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(57) Abstract 

The present invention relates to fatty acid desat- 
urases able to catalyze the conversion of oleic acid to 
linoleic acid, linoleic acid to 7-linolenic acid, or of al- 
pha-linolenic acid to stearidonic acid. Nucleic acid se- 
quences encoding desaturases, nucleic acid sequences 
which hybridize thereto, DNA constructs comprising a 
desaturase gene, and recombinant host microorganism 
or animal expressing increased levels of a desaturase 
are described. Methods for desaturating a fatty acid 
and for producing a desaturated fatty acid by express- 
ing increased levels of a desaturase are disclosed. Fatty 
acids, and oils containing them, which have been de- 
saturated by a desaturase produced by recombinant host 
microorganisms or animals are provided. Pharmaceuti- 
cal compositions, infant fonnulas or dietary supplements 
containing fatty acids which have been desaturated by a 
desaturase produced by a recombinant host microorgan- 
ism or animal also are described. 
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WO 98/46763 PCT/US98/07126 

METHODS AND COMPOSITIONS FOR SYNTHESIS OF 
LONG CHAIN POLYUNSATURATED FATTY ACIDS 

RELATED APPLICATIONS 

5 This application is a contiuation-in-part application of United States 

Patent Application Serial No. 08/834,655 filed April 1 1, 1997. 

INTRODUCTION 

Field of the Invention 

This invention relates to modulating levels of enzymes and/or enzyme 
10 components relating to production of long chain poly-unsaturated fatty acids 

(PUFAs) in a microorganism or animal. 

Background 

Two main families of polyunsaturated fatty acids (PUFAs) are the ©3 
fatty acids, exemplified by eicosapentaenoic acid (EPA), and the co6 fatty acids, 
15 exemplified by arachidonic acid (ARA). PUFAs are important components of 

the plasma membrane of the cell, where they may be found in such forms as 
phospholipids. PUFAs are necessary for proper development, particularly in the 
developing infant brain, and for tissue formation and repair. PUFAs also serve 
as precursors to other molecules of importance in human beings and animals, 
20 including the prostacyclins, eicosanoids, leukotrienes and prostaglandins. Four 

major long chain PUFAs of importance include docosahexaenoic acid (DHA) 
and EPA, which are primarily found in different types of fish oil, y-hnolenic 
acid (GLA), which is found in the seeds of a number of plants, including 
evening primrose (Oenothera biennis), borage (Borago officinalis) and black 
25 currants (Ribes nigrum), and stearidonic acid (SDA), which is found in marine 

oils and plant seeds. Both GLA and another important long chain PUFA, 
arachidonic acid (ARA), are found in filamentous fungi. ARA can be purified 
from animal tissues including liver and adrenal gland. GLA, ARA, EPA and 
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SDA are themselves, or are dietary precursors to, important long chain fatty 
acids involved in prostaglandin synthesis, in treatment of heart disease, and in 
development of brain tissue. 

For DHA, a number of sources exist for commercial production 
5 including a variety of marine organisms, oils obtained from cold water marine 

fish, and egg yolk fractions. For ARA, microorganisms including the genera 
Mortierella, Entomophthora, Phytium and Porphyridium can be used for 
commercial production. Commercial sources of SDA include the genera 
Trichodesma and Echium. Commercial sources of GLA include evening 

1 0 primrose, black currants and borage. However, there are several disadvantages 

associated with commercial production of PUFAs from natural sources. Natural 
sources of PUFAs, such as animals and plants, tend to have highly 
heterogeneous oil compositions. The oils obtained from these sources therefore 
can require extensive purification to separate out one or more desired PUFAs or 

15 to produce an oil which is enriched in one or more PUFA. Natural sources also 

are subject to uncontrollable fluctuations in availability. Fish stocks may 
undergo natural variation or may be depleted by overfishing. Fish oils have 
unpleasant tastes and odors, which may be impossible to economically separate 
from the desired product, and can render such products unacceptable as food 

20 supplements. Animal oils, and particularly fish oils, can accumulate 

environmental pollutants. Weather and disease can cause fluctuation in yields 
from both fish and plant sources. Cropland available for production of alternate 
oil-producing crops is subject to competition from the steady expansion of 
human populations and the associated increased need for food production on the 

25 remaining arable land. Crops which do produce PUFAs, such as borage, have 

not been adapted to commercial growth and may not perform well in 
monoculture. Growth of such crops is thus not economically competitive where 
more profitable and better established crops can be grown. Large scale 
fermentation of organisms such as Mortierella is also expensive. Natural 

30 animal tissues contain low amounts of ARA and are difficult to process. 

Microorganisms such as Porphyridium and Mortierella are difficult to cultivate 
on a commercial scale. 
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Dietary supplements and pharmaceutical formulations containing 
PUFAs can retain the disadvantages of the PUFA source. Supplements such as 
fish oil capsules can contain low levels of the particular desired component and 
thus require large dosages. High dosages result in ingestion of high levels of 
5 undesired components, including contaminants. Unpleasant tastes and odors of 

the supplements can make such regimens undesirable, and may inhibit 
compliance by the patient. Care must be taken in providing fatty acid 
supplements, as overaddition may result in suppression of endogenous 
biosynthetic pathways and lead to competition with other necessary fatty acids 
10 in various lipid fractions in v/vo, leading to undesirable results. For example, 

Eskimos having a diet high in co3 fatty acids have an increased tendency to 
bleed (U.S. Pat. No. 4,874,603). 

A number of enzymes are involved in PUFA biosynthesis. Linoleic acid 
(LA, 18:2 A9, 12) is produced from oleic acid (18:1 A9) by a A12-desaturase. 

15 GLA (18:3 A6, 9, 12) is produced from linoleic acid (LA, 18:2 A9, 12) by a A6- 

desaturase. ARA (20:4 A5, 8, 11, 14) production from dihomo-Y-Hnolenic acid 
(DGLA, 20:3 A8, 11, 14) is catalyzed by a A5-desaturase. However, animals 
cannot desaturate beyond the A9 position and therefore cannot convert oleic 
acid (18:1 A9) into linoleic acid (18:2 A9, 12). Likewise, cc-linolenic acid 

20 (ALA, 18:3 A9, 12, 15) cannot be synthesized by mammals. Other eukaryotes, 

including fungi and plants, have enzymes which desaturate at positions A12 and 
A15. The major poly-unsaturated fatty acids of animals therefore are either 
derived from diet and/or from desaturation and elongation of linoleic acid (18:2 
A9, 12) or oc-linolenic acid (1 8:3 A9, 12, 15). Therefore it is of interest to obtain 

25 genetic material involved in PUFA biosynthesis from species that naturally 

produce these fatty acids and to express the isolated material in a microbial or 
animal system which can be manipulated to provide production of commercial 
quantities of one or more PUFAs. Thus there is a need for fatty acid 
desaturases, genes encoding them, and recombinant methods of producing them. 

30 A need further exists for oils containing higher relative proportions of and/or 
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enriched in specific PUFAs. A need also exists for reliable economical methods 
of producing specific PUFAs. 

Relevant Literature 

Production of y-Hnolenic acid by a A6-desaturase is described in USPN 
5 5,552,306. Production of 8, 1 1-eicosadienoic acid using Mortierella alpina is 

disclosed in USPN 5,376,541. Production of docosahexaenoic acid by 
dinoflagellates is described in USPN 5,407,957. Cloning of a A6-palmitoyl- 
acyl carrier protein desaturase is described in PCT publication WO 96/13591 
and USPN 5,614,400. Cloning of a A6-desaturase from borage is described in 

1 0 PCT publication WO 96/2 1 022. Cloning of A9-desaturases is described in the 

published patent applications PCT WO 91/13972, EP 0 550 162 Al, EP 0 561 
569 A2, EP 0 644 263 A2, and EP 0 736 598 Al, and in USPN 5,057,419. 
Cloning of A12-desaturases from various organisms is described in PCT 
publication WO 94/1 1516 and USPN 5,443,974. Cloning of A15-desaturases 

1 5 from various organisms is described in PCT publication WO 93/1 1245. All 

publications and U.S. patents or applications referred to herein are hereby 
incorporated in their entirety by reference. 

SUMMARY OF THE INVENTION 

Novel compositions and methods are provided for preparation of poly- 
20 unsaturated long chain fatty acids. The compositions include nucleic acid 

encoding a A6- and A 12- desaturase and/or polypeptides having A6- and/or A 12- 
desaturase activity, the polypeptides, and probes isolating and detecting the 
same. The methods involve growing a host microorganism or animal 
expressing an introduced gene or genes encoding at least one desaturase, 
25 particularly a A6-, A9-, A 12- or A15-desaturase. The methods also involve the 

use of antisense constructs or gene disruptions to decrease or eliminate the 
expression level of undesired desaturases. Regulation of expression of the 
desaturase polypeptide(s) provides for a relative increase in desired desaturated 
PUFAs as a result of altered concentrations of enzymes and substrates involved 
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in PUFA biosynthesis. The invention finds use, for example, in the large scale 
production of GLA, DGLA, ARA, EPA, DHA and SDA. 

In a preferred embodiment of the invention, an isolated nucleic acid 
comprising: a nucleotide sequence depicted in Figure 3A-E (SEQ ID NO: 1) or 
5 Figure 5A-D (SEQ ID NO: 3), a polypeptide encoded by a nucleotide sequence 

according Figure 3A-E (SEQ ID NO: 1) or Figure 5A-D (SEQ ID NO: 3), and 
a purified or isolated polypeptide comprising an amino acid sequence depicted 
in Figure 3A-E (SEQ ID NO: 2) or Figure 5A-D (SEQ ID NO: 4). In another 
embodiment of the invention, provided is an isolated nucleic acid encoding a 
10 polypeptide having an amino acid sequence depicted in Figure 3A-E (SEQ ID 

NO: 2) or Figure 5A-D (SEQ ID NO: 4). 

Also provided is an isolated nucleic acid comprising a nucleotide 
sequence which encodes a polypeptide which desaturates a fatty acid molecule 
at carbon 6 or 12 from the carboxyl end, wherein said nucleotide sequence has 
15 an average A/T content of less than about 60%. In a preferred embodiment, the 

isolated nucleic acid is derived from a fungus, such as a fungus of the genus 
Mortierella. More preferred is a fungus of the species Mortierella alpina. 

In another preferred embodiment of the invention, an isolated nucleic 
acid is provided wherein the nucleotide sequence of the nucleic acid is depicted 
20 in Figure 3A-E (SEQ ID NO: 1) or Figure 5A-D (SEQ ID NO: 3). The 

invention also provides an isolated or purified polypeptide which desaturates a 
fatty acid molecule at carbon 6 or 12 from the carboxyl end, wherein the 
polypeptide is a eukaryotic polypeptide or is derived from a eukaryotic 
polypeptide, where a preferred eukaryotic polypeptide is derived from a fungus. 

25 The present invention further includes a nucleic acid sequence which 

hybridizes to Figure 3A-E (SEQ ID NO: 1) or Figure 5A-D (SEQ ID NO: 3). 
Preferred is an isolated nucleic acid having a nucleotide sequence with at least 
about 50% homology to Figure 3A-E (SEQ ID NO: 1) or Figure 5A-D (SEQ 
ID NO: 3). The invention also includes an isolated nucleic acid having a 

30 nucleotide sequence with at least about 50% homology to Figure 3A-E (SEQ 

ID NO: 1) or Figure 5A-D (SEQ ID NO: 3). In a preferred embodiment, the 



-5- 



r 

v 



WO 98/46763 PCT/US98/07126 



nucleic acid of the invention includes a nucleotide sequence which encodes an 
amino acid sequence depicted in Figure 3A-D (SEQ ID NO: 2) which is 
selected from the group consisting of amino acid residues 50-53, 39-43, 172- 
176, 204-213, and 390-402. 

5 Also provided by the present invention is a nucleic acid construct 

comprising a nucleotide sequence depicted in a Figure 3A-E (SEQ ID NO: 1) 
or Figure 5A-D (SEQ ID NO: 3) linked to a heterologous nucleic acid. In 
another embodiment, a nucleic acid construct is provided which comprises a 
nucleotide sequence depicted in a Figure 3A-E (SEQ ID NO: 1) or Figure 5A- 

10 D (SEQ ID NO: 3) operably associated with an expression control sequence 

functional in a host cell. The host cell is either eukaryotic or prokaryotic. 
Preferred eukaryotic host cells are those selected from the group consisting of a 
mammalian cell, an insect cell, a fungal cell, and an algae cell. Preferred 
mammalian cells include an avian cell, a preferred fungal cell includes a yeast 

1 5 cell, and a preferred algae cell is a marine algae cell. Preferred prokaryotic cells 

include those selected from the group consisting of a bacteria, a cyanobacteria, 
cells which contain a bacteriophage, and/or a virus. The DNA sequence of the 
. recombinant host cell preferably contains a promoter which is functional in the 
host cell, which promoter is preferably inducible. In a more preferred 

20 embodiment, the microbial cell is a fungal cell of the genus Mortierella* with a 

more preferred fungus is of the species Mortierella alpina. 

In addition, the present invention provides a nucleic acid construct 
comprising a nucleotide sequence which encodes a polypeptide comprising an 
amino acid sequence which corresponds to or is complementary to an amino 

25 acid sequence depicted in Figure 3 A-E (SEQ ID NO: 2) or Figure 5 A-D (SEQ 

ID NO: 4), wherein the nucleic acid is operably associated with an expression 
control sequence functional in a microbial cell, wherein the nucleotide sequence 
encodes a functionally active polypeptide which desaturates a fatty acid 
molecule at carbon 6 or carbon 12 from the carboxyl end of a fatty acid 

30 molecule. Another embodiment of the present invention is a nucleic acid 

construct comprising a nucleotide sequence which encodes a functionally 
active A6-desaturase having an amino acid sequence which corresponds to or is 
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complementary to all of or a portion of an amino acid sequence depicted in a 
Figure 3A-E (SEQ ID NO: 2), wherein the nucleotide sequence is operably 
associated with a transcription control sequence functional in a host cell. 

Yet another embodiment of the present invention is a nucleic acid 
5 construct comprising a nucleotide sequence which encodes a functionally 

active A12-desaturase having an amino acid sequence which corresponds to or 
is complementary to all of or a portion of an amino acid sequence depicted in a 
Figure 5A-D (SEQ ID NO: 4), wherein the nucleotide sequence is operably 
associated with a transcription control sequence functional in a host cell. The 

10 host cell, is either a eukaryotic or prokaryotic host cell. Preferred eukaryotic 

host cells are those selected from the group consisting of a mammalian cell, an 
insect cell, a fungal cell, and an algae cell. Preferred mammalian cells include 
an avian cell, a preferred fungal cell includes a yeast cell, and a preferred algae 
cell is a marine algae cell. Preferred prokaryotic cells include those selected 

1 5 from the group consisting of a bacteria, a cyanobacteria, cells which contain a 

bacteriophage, and/or a virus. The DNA sequence of the recombinant host cell 
preferably contains a promoter which is functional in the host cell and which 
preferably is inducible. A preferred recombinant host cell is a microbial cell 
such as a yeast cell, such as a Saccharomyces cell. 

20 The present invention also provides a recombinant microbial cell 

comprising at least one copy of a nucleic acid which encodes a functionally 
active Mortierella alpina fatty acid desaturase having an amino acid sequence 
as depicted in Figure 3A-E (SEQ ID NO: 2), wherein the cell or a parent of the 
cell was transformed with a vector comprising said DNA sequence, and wherein 

25 the DNA sequence is operably associated with an expression control sequence. 

In a preferred embodiment, the cell is a microbial cell which is enriched in 18:2 
fatty acids, particularly where the microbial cell is from a genus selected from 
the group consisting of a prokaryotic cell and eukaryotic cell. In another 
preferred embodiment, the microbial cell according to the invention includes an 

30 expression control sequence which is endogenous to the microbial cell. 



-7- 



> J* 

WO 98/46763 PCT/US98/07126 

Also provided by the present invention is a method for production of 
GLA in a host cell, where the method comprises growing a host culture having 
a plurality of host cells which contain one or more nucleic acids encoding a 
polypeptide which converts LA to GLA, wherein said one or more nucleic acids 
5 is operably associated with an expression control sequence, under conditions 

whereby said one or more nucleic acids are expressed, whereby GLA is 
produced in the host cell. In several preferred embodiments of the methods, the 
polypeptide employed in the method is a functionally active enzyme which 
desaturates a fatty acid molecule at carbon 6 from the carboxyl end of a fatty 
10 acid molecule; the said one or more nucleic acids is derived from a Mortierella 

alpina; the substrate for the polypeptide is exogenously supplied; the host cells 
are microbial cells; the microbial cells are yeast cells, such as Saccharomyces 
cells; and the growing conditions are inducible. 

Also provided is an oil comprising one or more PUFA, wherein the 
15 amount of said one or more PUFAs is approximately 0.3-30% arachidonic acid 

(ARA), approximately 0.2-30% dihomo-y-linolenic acid (DGLA), and 
approximately 0.2-30% y-linoleic acid (GLA). A preferred oil of the invention 
is one in which the ratio of ARA:DGLA:GLA is approximately 1.0:19.0:30 to 
6.0:1 .0:0.2. Another preferred embodiment of the invention is a pharmaceutical 
20 composition comprising the oils in a pharmaceutical ly acceptable carrier. 

Further provided is a nutritional composition comprising the oils of the 
invention. The nutritional compositions of the invention preferably are 
administered to a mammalian host parenterally or internally. A preferred 
composition of the invention for internal consumption is an infant formula. In a 
25 preferred embodiment, the nutritional compositions of the invention are in a 

liquid form or a solid form, and can be formulated in or as a dietary supplement, 
and the oils provided in encapsulated form. The oils of the invention can be 
free of particular components of other oils and can be derived from a microbial 
cell, such as a yeast cell. 

30 The present invention further provides a method for desaturating a fatty 

acid. In a preferred embodiment the method comprises culturing a recombinant 
microbial cell according to the invention under conditions suitable for 
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expression of a polypeptide encoded by said nucleic acid, wherein the host cell 
further comprises a fatty acid substrate of said polypeptide. Also provided is a 
fatty acid desaturated by such a method, and an oil composition comprising a 
fatty acid produced according to the methods of the invention. 

5 The present invention further includes a purified nucleotide sequence or 

polypeptide sequence that is substantially related or homologous to the 
nucleotide and peptide sequences presented in SEQ ID NO:l - SEQ ID NO:40. 
The present invention is further directed to methods of using the sequences 
presented in SEQ ID NO: 1 to SEQ ID NO.40 as probes to identify related 
1 0 sequences, as components of expression systems and as components of systems 

useful for producing transgenic oil. 

The present invention is further directed to formulas, dietary 
supplements or dietary supplements in the form of a liquid or a solid containing 
the long chain fatty acids of the invention. These formulas and supplements 
1 5 may be administered to a human or an animal. 

The formulas and supplements of the invention may further comprise at 
least one macronutrient selected from the group consisting of coconut oil, soy 
oil, canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
20 hydrolysates. 

The formulas of the present invention may further include at least one 
vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
25 chloride, iodine, selenium, and iron. 

The present invention is further directed to a method of treating a patient 
having a condition caused by insuffient intake or production of polyunsaturated 
fatty acids comprising administering to the patient a dietary substitute of the 
invention in an amount sufficient to effect treatment of the patient. 

30 The present invention is further directed to cosmetic and pharmaceutical 

compositions of the material of the invention. 
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The present invention is further directed to transgenic oils in 
pharmaceutically acceptable carriers. The present invention is further directed 
to nutritional supplements, cosmetic agents and infant formulae containing 
transgenic oils. 

5 The present invention is further directed to a method for obtaining 

altered long chain polyunsaturated fatty acid biosynthesis comprising the steps 
of: growing a microbe having cells which contain a transgene which encodes a 
transgene expression product which desaturates a fatty acid molecule at carbon 
6 or 12 from the carboxyl end of said fatty acid molecule, wherein the trangene 
10 is operably associated with an expression control sequence, under conditions 

whereby the transgene is expressed, whereby long chain polyunsaturated fatty 
acid biosynthesis in the cells is altered. 

The present invention is further directed toward pharmaceutical 
compositions comprising at least one nutrient selected from the group consisting 
15 of a vitamin, a mineral, a carbohydrate, a sugar, an amino acid, a free fatty acid, 

a phospholipid, an antioxidant, and a phenolic compound. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows possible pathways for the synthesis of arachidonic acid 
20 (20:4 A5, 8, 1 1, 14) and stearidonic acid (1 8:4 A6, 9, 12, 15) from palmitic acid 

(Ci 6 ) from a variety of organisms, including algae, Mortierella and humans. 
These PUFAs can serve as precursors to other molecules important for humans 
and other animals, including prostacyclins, leukotrienes, and prostaglandins, 
some of which are shown. 

25 Figure 2 shows possible pathways for production of PUFAs in addition 

to ARA, including EPA and DHA, again compiled from a variety of organisms. 

Figure 3A-E shows the DNA sequence of the Mortierella alpina A6- 
desaturase and the deduced amino acid sequence: 

Figure 3 A-E (SEQ ID NO 1 A6 DESATURASE cDNA) 

-10- 
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Figure 3A-E (SEQ ID NO 2 A6 DESATURASE AMINO ACID) 

Figure 4 shows an alignment of a portion of the Mortierella alpina A6- 
desaturase amino acid sequence with other related sequences. 

Figure 5A-D shows the DNA sequence of the Mortierella alpina A12- 
5 desaturase and the deduced amino acid sequence: 

Figure 5A-D (SEQ ID NO 3 A12 DESATURASE cDNA) 

Figure 5A-D (SEQ ID NO 4 Al 2 DESATURASE AMINO ACID). 

Figures 6A and 6B show the effect of different expression constructs on 
expression of GLA in yeast. 

10 Figures 7A and 7B show the effect of host strain on GLA production. 

Figures 8 A and 8B show the effect of temperature on GLA production in 
S. cerevisiae strain SC334. 

Figure 9 shows alignments of the protein sequence of the Ma 29 and 
contig 253538a. 

15 Figure 10 shows alignments of the protein sequence of Ma 524 and 

contig 253538a. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS 

SEQ ID NO:l shows the DNA sequence of the Mortierella alpina A6- 
desaturase. 

20 SEQ ID NO:2 shows the protein sequence of the Mortierella alpina A6- 

desaturase. 

SEQ ID NO:3 shows the DNA sequence of the Mortierella alpina A12- 
desaturase. 

SEQ ID NO:4 shows the protein sequence of the Mortierella alpina 
25 A12-desaturase. 

SEQ ID NO:5-l 1 show various desaturase sequences. 
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SEQ ID NO:13-18 show various PCR primer sequences. 

SEQ ID NO: 19 and SEQ ID NO.20 show the nucleotide and amino acid 
sequence of a Dictyostelium discoideum desaturase. 

SEQ ID NO:21 and SEQ ID NO:22 show the nucleotide and amino acid 
5 sequence of a Phaeodactylum tricornutum desaturase. 

SEQ ID NO:23-26 show the nucleotide and deduced amino acid 
sequence of a Schizochytrium cDNA clone. 

SEQ ID NO: 27-33 show nucleotide sequences for human desaturases. 

SEQ ID NO:34 - SEQ ID NO:40 show peptide sequences for human 
10 desaturases. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In order to ensure a complete understanding of the invention, the 
following definitions are provided: 

A5-Desaturase: A5 desaturase is an enzyme which introduces a double 
1 5 bond between carbons 5 and 6 from the carboxyl end of a fatty acid molecule. 

A6-Desaturase: A6-desaturase is an enzyme which introduces a double 
bond between carbons 6 and 7 from the carboxyl end of a fatty acid molecule. 

A9-Desaturase: A9-desaturase is an enzyme which introduces a double 
bond between carbons 9 and 10 from the carboxyl end of a fatty acid molecule. 

20 A12-Desaturase: A12-desaturase is an enzyme which introduces a 

double bond between carbons 12 and 13 from the carboxyl end of a fatty acid 
molecule. 

Fatty Acids: Fatty acids are a class of compounds containing a long 
hydrocarbon chain and a terminal carboxylate group. Fatty acids include the 
25 following: 



Fatty Acid 


12:0 


lauric acid 




16:0 


palmitic acid 
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! Fatty Acid 


16:1 


palmitoleic acid 




18:0 


stearic acid 




18:1 


oleic acid 


A9-18:l 


18:2 A5,9 


taxoleic acid 


A5,9-18:2 


18:2 A6,9 


6,9-octadecadienoic acid 


A6,9-18:2 


18:2 


Linolenic acid 


A9,12-18:2 (LA) 


18:3 A6,9,12 


Gamma-linolenic acid 


A6,9,12-18:3 (GLA) 


18:3 A5,9,12 


Pinolenic acid 


A5,9,12-18:3 


18:3 


alpha-linoleic acid 


A9,12,15-18:3 (ALA) 


18:4 


stearidonic acid 


A6,9,12,15-18:4(SDA) 


20:0 


/Al awl 1 1U IL, awiu 




20:1 


Eicoscenic Acid 




22:0 


behehic acid 




22:1 


erucic acid 




22:2 


docasadienoic acid 


- 


20:4 06 


arachidonic acid 


A5,8, 11,14-20:4 (ARA) 


20:3 g>6 


o>6-eicosatrienoic 
dihomo-gamma linolenic 


A8, 11,14-20:3 (DGLA) 


20:5 <d3 


Eicosapentanoic 
(Timnodonic acid) 


A5,8,l 1,14,17-20:5 (EPA) 


20:3 o)3 


<*)3-eicosatrienoic 


Al 1,16,17-20:3 


20:4 o>3 


co3-eicosatetraenoic 


A8,l 1,14,17-20:4 


22:5 o>3 


Docosapentaenoic 


A7,10,13,16,19-22:5 (g>3DPA) 


22:6 o>3 


Docosahexaenoic 
(cervonic acid) 


A4,7,10,13,16,19-22:6 (DHA) 


24:0 


Lignoceric acid 





Taking into account these definitions, the present invention is directed to 
novel DNA sequences, DNA constructs, methods and compositions are 
provided which permit modification of the poly-unsaturated long chain fatty 
5 acid content of, for example, microbial cells or animals. Host cells are 

manipulated to express a sense or antisense transcript of a DNA encoding a 
polypeptide(s) which catalyzes the desaturation of a fatty acid. The substrate(s) 
for the expressed enzyme may be produced by the host cell or may be 
exogenously supplied. To achieve expression, the transformed DNA is 
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operably associated with transcriptional and translational initiation and 
termination regulatory regions that are functional in the host cell. Constructs 
comprising the gene to be expressed can provide for integration into the genome 
of the host cell or can autonomously replicate in the host cell. For production of 
5 linoleic acid (LA), the expression cassettes generally used include a cassette 

which provides for A12-desaturase activity, particularly in a host cell which 
produces or can take up oleic acid (U.S. Patent No. 5,443,974). Production of 
LA also can be increased by providing an expression cassette for a A9- 
desaturase where that enzymatic activity is limiting. For production of ALA, 

1 0 the expression cassettes generally used include a cassette which provides for 

A 15- or co3-desaturase activity, particularly in a host cell which produces or can 
take up LA. For production of GLA or SDA, the expression cassettes generally 
used include a cassette which provides for A6-desaturase activity, particularly in 
a host cell which produces or can take up LA or ALA, respectively. Production 

1 5 of o6-type unsaturated fatty acids, such as LA or GLA, is favored in a host 

microorganism or animal which is incapable of producing ALA. The host ALA 
production can be removed, reduced and/or inhibited by inhibiting the activity 
. of a Al 5- or ©3- type desaturase (see Figure 2). This can be accomplished by 
standard selection, providing an expression cassette for an antisense A15 or co3 

20 transcript, by disrupting a target Al 5- or co3-desaturase gene through insertion, 

deletion, substitution of part or all of the target gene, or by adding an inhibitor 
of Al 5- or co3-desaturase. Similarly, production of LA or ALA is favored in a 
microorganism or animal having A6-desaturase activity by providing an 
expression cassette for an antisense A6 transcript, by disrupting a A6-desaturase 

25 gene, or by use of a A6-desaturase inhibitor. 

MICROBIAL PRODUCTION OF FATTY ACIDS 

Microbial production of fatty acids has several advantages over 
purification from natural sources such as fish or plants. Many microbes are 
known with greatly simplified oil compositions compared with those of higher 
30 organisms, making purification of desired components easier. Microbial 

production is not subject to fluctuations caused by external variables such as 
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weather and food supply. Microbially produced oil is substantially free of 
contamination by environmental pollutants. Additionally, microbes can provide 
PUFAs in particular forms which may have specific uses. For example, 
Spirulina can provide PUFAs predominantly at the first and third positions of 
5 triglycerides; digestion by pancreatic lipases preferentially releases fatty acids 

from these positions. Following human or animal ingestion of triglycerides 
derived from Spirulina, these PUFAs are released by pancreatic lipases as free 
fatty acids and thus are directly available, for example, for infant brain 
development. Additionally, microbial oil production can be manipulated by 

1 0 controlling culture conditions, notably by providing particular substrates for 

microbially expressed enzymes, or by addition of compounds which suppress 
undesired biochemical pathways. In addition to these advantages, production of 
fatty acids from recombinant microbes provides the ability to alter the naturally 
occurring microbial fatty acid profile by providing new synthetic pathways in 

1 5 the host or by suppressing undesired pathways, thereby increasing levels of 

desired PUFAs, or conjugated forms thereof, and decreasing levels of undesired 
PUFAs. 

PRODUCTION OF FATTY ACIDS IN ANIMALS 

Production of fatty acids in animals also presents several advantages. 

20 Expression of desaturase genes in animals can produce greatly increased levels 

of desired PUFAs in animal tissues, making recovery from those tissues more 
economical. For example, where the desired PUFAs are expressed in the breast 
milk of animals, methods of isolating PUFAs from animal milk are well 
established. In addition to providing a source for purification of desired 

25 PUFAs, animal breast milk can be manipulated through expression of 

desaturase genes, either alone or in combination with other human genes, to 
provide animal milks substantially similar to human breast milk during the 
different stages of infant development. Humanized animal milks could serve as 
infant formulas where human nursing is impossible or undesired, or in cases of 

30 malnourishment or disease. 

Depending upon the host cell, the availability of substrate, and the 
desired end product(s), several polypeptides, particularly desaturases, are of 
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interest. By "desaturase" is intended a polypeptide which can desaturate one or 
more fatty acids to produce a mono- or poly-unsaturated fatty acid or precursor 
thereof of interest. Of particular interest are polypeptides which can catalyze 
the conversion of stearic acid to oleic acid, of oleic acid to LA, of LA to ALA, 
5 of LA to GLA, or of ALA to SDA, which includes enzymes which desaturate at 

the A9, A12, (co6), A15, (co3) or A6 positions. By "polypeptide" is meant any 
chain of amino acids, regardless of length or post-translational modification, for 
example, glycosylation or phosphorylation. Considerations for choosing a 
specific polypeptide having desaturase activity include the pH optimum of the 

10 polypeptide, whether the polypeptide is a rate limiting enzyme or a component 

thereof, whether the desaturase used is essential for synthesis of a desired poly- 
unsaturated fatty acid, and/or co-factors required by the polypeptide. The 
expressed polypeptide preferably has parameters compatible with the 
biochemical environment of its location in the host cell. For example, the 

15 polypeptide may have to compete for substrate with other enzymes in the host 

cell. Analyses of the K m and specific activity of the polypeptide in question 
therefore are considered in determining the suitability of a given polypeptide for 
modifying PUFA production in a given host cell. The polypeptide used in a 
particular situation is one which can function under the conditions present in the 

20 intended host cell but otherwise can be any polypeptide having desaturase 

activity which has the desired characteristic of being capable of modifying the 
relative production of a desired PUFA. 

For production of linoleic acid from oleic acid, the DNA sequence used 
encodes a polypeptide having A12-desaturase activity. For production of GLA 

25 from linoleic acid, the DNA sequence used encodes a polypeptide having A6- 

desaturase activity. In particular instances, expression of A6-desaturase activity 
can be coupled with expression of A12-desaturase activity and the host cell can 
optionally be depleted of any A15-desaturase activity present, for example by 
providing a transcription cassette for production of antisense sequences to the 

30 A15-desaturase transcription product, by disrupting the A15-desaturase gene, or 

by using a host cell which naturally has, or has been mutated to have, low A15- 
desaturase activity. Inhibition of undesired desaturase pathways also can be 
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accomplished through the use of specific desaturase inhibitors such as those 
described in U.S. Patent No. 4,778,630. Also, a host cell for A6-desaturase 
expression may have, or have been mutated to have, high A12-desaturase 
activity. The choice of combination of cassettes used depends in part on the 
5 PUFA profile and/or desaturase profile of the host cell. Where the host cell 

expresses A12-desaturase activity and lacks or is depleted in A15-desaturase 
activity, overexpression of A6-desaturase alone generally is sufficient to provide 
for enhanced GLA production. Where the host cell expresses A9-desaturase 
activity, expression of a A 12- and a A6-desaturase can provide for enhanced 

1 0 GLA production. When A9-desaturase activity is absent or limiting, an 

expression cassette for A9-desaturase can be used. A scheme for the synthesis 
of arachidonic acid (20:4 A 5,8,11, M ) from stearic acid (18:0) is shown in Figure 
2. A key enzyme in this pathway is a A6-desaturase which converts the linoleic 
acid into y-linolenic acid. Conversion of ot-linolenic acid (ALA) to stearidonic 

1 5 acid by a A6-desaturase also is shown. 

SOURCES OF POLYPEPTIDES 
HAVING DESATURASE ACTIVITY 

A source of polypeptides having desaturase activity and oligonucleotides 
encoding such polypeptides are organisms which produce a desired poly- 

20 unsaturated fatty acid. As an example, microorganisms having an ability to 

produce GLA or ARA can be used as a source of A6- or A12- desaturase 
activity. Such microorganisms include, for example, those belonging to the 
genera Mortierella, Conidiobolus, Pythium, Phytophathora, Penicillium, 
Porphyridium, Coidosporium, Mucor, Fusarium, Aspergillus, Rhodotorula, and 

25 Entomophthora. Within the genus Porphyridium, of particular interest is 

Porphyridium cruentum. Within the genus Mortierella, of particular interest are 
Mortierella elongata, Mortierella exigua, Mortierella hygrophila, Mortierella 
ramanniana, var. angulispora, and Mortierella alpina. Within the genus Mucor, 
of particular interest are Mucor circinelloides and Mucor javanicus. 

30 DNAs encoding desired desaturases can be identified in a variety of 

ways. As an example, a source of the desired desaturase, for example genomic 
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or cDNA libraries from Mortierella, is screened with detectable enzymatically- 
or chemically-synthesized probes, which can be made from DNA, RNA, or non- 
naturally occurring nucleotides, or mixtures thereof. Probes may be 
enzymatically synthesized from DNAs of known desaturases for normal or 
5 reduced-stringency hybridization methods. Oligonucleotide probes also can be 

used to screen sources and can be based on sequences of known desaturases, 
including sequences conserved among known desaturases, or on peptide 
sequences obtained from the desired purified protein. Oligonucleotide probes 
based on amino acid sequences can be degenerate to encompass the degeneracy 

10 of the genetic code, or can be biased in favor of the preferred codons of the 

source organism. Oligonucleotides also can be used as primers for PCR from 
reverse transcribed mRNA from a known or suspected source; the PCR product 
can be the full length cDNA or can be used to generate a probe to obtain the 
desired full length cDNA. Alternatively, a desired protein can be entirely 

15 sequenced and total synthesis of a DNA encoding that polypeptide performed. 

Once the desired genomic or cDNA has been isolated, it can be 
sequenced by known methods. It is recognized in the art that such methods are 
subject to errors, such that multiple sequencing of the same region is routine and 
is still expected to lead to measurable rates of mistakes in the resulting deduced 

20 sequence, particularly in regions having repeated domains, extensive secondary 

structure, or unusual base compositions, such as regions with high GC base 
content. When discrepancies arise, resequencing can be done and can employ 
special methods. Special methods can include altering sequencing conditions 
by using: different temperatures; different enzymes; proteins which alter the 

25 ability of oligonucleotides to form higher order structures; altered nucleotides 

such as ITP or methylated dGTP; different gel compositions, for example 
adding formamide; different primers or primers located at different distances 
from the problem region; or different templates such as single stranded DNAs. 
Sequencing of mRNA also can be employed. 

30 For the most part, some or all of the coding sequence for the polypeptide 

having desaturase activity is from a natural source. In some situations, 
however, it is desirable to modify all or a portion of the codons, for example, to 
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enhance expression, by employing host preferred codons. Host preferred 
codons can be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in a particular host species of interest. Thus, the 
coding sequence for a polypeptide having desaturase activity can be 
5 synthesized in whole or in part. All or portions of the DNA also can be 

synthesized to remove any destabilizing sequences or regions of secondary 
structure which would be present in the transcribed mRNA. All or portions of 
the DNA also can be synthesized to alter the base composition to one more 
preferable in the desired host cell. Methods for synthesizing sequences and 

1 0 bringing sequences together are well established in the literature. In vitro 

mutagenesis and selection, site-directed mutagenesis, or other means can be 
employed to obtain mutations of naturally occurring desaturase genes to 
produce a polypeptide having desaturase activity in vivo with more desirable 
physical and kinetic parameters for function in the host cell, such as a longer 

15 half-life or a higher rate of production of a desired polyunsaturated fatty acid. 

Mortieralla alpina Desaturase 

Of particular interest is the Mortierella alpina A6-desaturase, which has 
457 amino acids and a predicted molecular weight of 5 1 .8 kD; the amino acid 
sequence is shown in Figure 3. The gene encoding the Mortierella alpina A6- 

20 desaturase can be expressed in transgenic microorganisms or animals to effect 

greater synthesis of GLA from linoleic acid or of stearidonic acid from ALA. 
Other DNAs which are substantially identical to the Mortierella alpina A6- 
desaturase DNA, or which encode polypeptides which are substantially identical 
to the Mortierella alpina A6-desaturase polypeptide, also can be used. By 

25 substantially identical is intended an amino acid sequence or nucleic acid 

sequence exhibiting in order of increasing preference at least 60%, 80%, 90% or 
95% homology to the Mortierella alpina A6-desaturase amino acid sequence or 
nucleic acid sequence encoding the amino acid sequence. For polypeptides, the 
length of comparison sequences generally is at least 16 amino acids, preferably 

30 at least 20 amino acids, or most preferably 35 amino acids. For nucleic acids, 

the length of comparison sequences generally is at least 50 nucleotides, 
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preferably at least 60 nucleotides, and more preferably at least 75 nucleotides, 
and most preferably, 1 1 0 nucleotides. Homology typically is measured using 
sequence analysis software, for example, the Sequence Analysis software 
package of the Genetics Computer Group, University of Wisconsin 
5 Biotechnology Center, 1710 University Avenue, Madison, Wisconsin 53705, 

MEGAlign (DNAStar, Inc., 1228 S. Park St., Madison, Wisconsin 53715), and 
MacVector (Oxford Molecular Group, 2105 S. Bascom Avenue, Suite 200, 
Campbell, California 95008). Such software matches similar sequences by 
assigning degrees of homology to various substitutions, deletions, and other 

10 modifications. Conservative substitutions typically include substitutions within 

the following groups: glycine and alanine; valine, isoleucine and leucine; 
aspartic acid, glutamic acid, asparagine, and glutamine; serine and threonine; 
lysine and arginine; and phenylalanine and tyrosine. Substitutions may also be 
made on the basis of conserved hydrophobicity or hydrophilicity (Kyte and 

15 Doolittle, J. Mol Biol 157: 105-132, 1982), or on the basis of the ability to 

assume similar polypeptide secondary structure (Chou and Fasman, Adv. 
Enzymol. 47: 45-148, 1978). 

Also of interest is the Mortierella alpina A12-desaturase, the nucleotide 
and amino acid sequence of which is shown in Figure 5. The gene encoding the 
20 Mortierella alpina A12-desaturase can be expressed in transgenic 

microorganisms or animals to effect greater synthesis of LA from oleic acid. 
Other DNAs which are substantially identical to the Mortierella alpina A 12- 
desaturase DNA, or which encode polypeptides which are substantially identical 
to the Mortierella alpina A12-desaturase polypeptide, also can be used. 

25 Other Desaturases 

Encompassed by the present invention are related desaturases from the 
same or other organisms. Such related desaturases include variants of the 
disclosed A6- or A12-desaturase naturally occurring within the same or different 
species of Mortierella, as well as homologues of the disclosed A6- or A12- 
30 desaturase from other species. Also included are desaturases which, although 

-20- 



WO 98/46763 PCT/US98/07126 



not substantially identical to the Mortierella alpina A6- or A12-desaturase, 
desaturate a fatty acid molecule at carbon 6 or 12, respectively, from the 
carboxyl end of a fatty acid molecule, or at carbon 12 or 6 from the terminal 
methyl carbon in an 1 8 carbon fatty acid molecule. Related desaturases can be 
5 identified by their ability to function substantially the same as the disclosed 

desaturases; that is, are still able to effectively convert LA to GLA, ALA to 
SDA or oleic acid to LA. Related desaturases also can be identified by 
screening sequence databases for sequences homologous to the disclosed 
desaturases, by hybridization of a probe based on the disclosed desaturases to a 
1 0 library constructed from the source organism, or by RT-PCR using mRNA from 

the source organism and primers based on the disclosed desaturases. Such 
desaturases include those from humans, Dictyostelium discoideum and 
Phaeodactylum tricornum. 

The regions of a desaturase polypeptide important for desaturase .activity 

15 can be determined through routine mutagenesis, expression of the resulting 

mutant polypeptides and determination of their activities. Mutants may include 
deletions, insertions and point mutations, or combinations thereof. A typical 
functional analysis begins with deletion mutagenesis to determine the N- and C- 
terminal limits of the protein necessary for function, and then internal deletions, 

20 insertions or point mutants are made to further determine regions necessary for 

function. Other techniques such as cassette mutagenesis or total synthesis also 
can be used. Deletion mutagenesis is accomplished, for example, by using 
exonucleases to sequentially remove the 5' or 3' coding regions. Kits are 
available for such techniques. After deletion, the coding region is completed by 

25 ligating oligonucleotides containing start or stop codons to the deleted coding 

region after 5' or 3' deletion, respectively. Alternatively, oligonucleotides 
encoding start or stop codons are inserted into the coding region by a variety of 
methods including site-directed mutagenesis, mutagenic PCR or by ligation 
onto DNA digested at existing restriction sites. Internal deletions can similarly 

30 be made through a variety of methods including the use of existing restriction 

sites in the DNA, by use of mutagenic primers via site directed mutagenesis or 
mutagenic PCR. Insertions are made through methods such as linker-scanning 
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mutagenesis, site-directed mutagenesis or mutagenic PCR. Point mutations are 
made through techniques such as site-directed mutagenesis or mutagenic PCR. 

Chemical mutagenesis also can be used for identifying regions of a 
desaturase polypeptide important for activity. A mutated construct is expressed, 
5 and the ability of the resulting altered protein to function as a desaturase is 

assayed. Such structure-function analysis can determine which regions may be 
deleted, which regions tolerate insertions, and which point mutations allow the 
mutant protein to function in substantially the same way as the native 
desaturase. All such mutant proteins and nucleotide sequences encoding them 
1 0 are within the scope of the present invention. 

EXPRESSION OF DESATURASE GENES 

Once the DNA encoding a desaturase polypeptide has been obtained, it 
is placed in a vector capable of replication in a host cell, or is propagated in 
vitro by means of techniques such as PCR or long PCR. Replicating vectors 

1 5 can include plasmids, phage, viruses, cosmids and the like. Desirable vectors 

include those useful for mutagenesis of the gene of interest or for expression of 
the gene of interest in host cells. The technique of long PCR has made in vitro 
propagation of large constructs possible, so that modifications to the gene of 
interest, such as mutagenesis or addition of expression signals, and propagation 

20 of the resulting constructs can occur entirely in vitro without the use of a 

replicating vector or a host cell. 

For expression of a desaturase polypeptide, functional transcriptional 
and translational initiation and termination regions are operably linked to the 
DNA encoding the desaturase polypeptide. Expression of the polypeptide 
25 coding region can take place in vitro or in a host cell. Transcriptional and 

translational initiation and termination regions are derived from a variety of 
nonexclusive sources, including the DNA to be expressed, genes known or 
suspected to be capable of expression in the desired system, expression vectors, 
chemical synthesis, or from an endogenous locus in a host cell. 
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Expression In Vitro 

In vitro expression can be accomplished, for example, by placing the 
coding region for the desaturase polypeptide in an expression vector designed 
for in vitro use and adding rabbit reticulocyte lysate and cofactors; labeled 
amino acids can be incorporated if desired. Such in vitro expression vectors 
may provide some or all of the expression signals necessary in the system used. 
These methods are well known in the art and the components of the system are 
commercially available. The reaction mixture can then be assayed directly for 
the polypeptide, for example by determining its activity, or the synthesized 
polypeptide can be purified and then assayed. 

Expression In A Host Cell 

Expression in a host cell can be accomplished in a transient or stable 
fashion. Transient expression can occur from introduced constructs which 
contain expression signals functional in the host cell, but which constructs do 
not replicate and rarely integrate in the host cell, or where the host cell is not 
proliferating. Transient expression also can be accomplished by inducing the 
activity of a regulatable promoter operably linked to the gene of interest, 
although such inducible systems frequently exhibit a low basal level of 
expression. Stable expression can be achieved by introduction of a construct 
that can integrate into the host genome or that autonomously replicates in the 
host cell. Stable expression of the gene of interest can be selected for through 
the use of a selectable marker located on or transfected with the expression 
construct, followed by selection for cells expressing the marker. When stable 
expression results from integration, integration of constructs can occur 
randomly within the host genome or can be targeted through the use of 
constructs containing regions of homology with the host genome sufficient to 
target recombination with the host locus. Where constructs are targeted to an 
endogenous locus, all or some of the transcriptional and translational regulatory 
regions can be provided by the endogenous locus. 
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When increased expression of the desaturase polypeptide in the source 
organism is desired, several methods can be employed. Additional genes 
encoding the desaturase polypeptide can be introduced into the host organism. 
Expression from the native desaturase locus also can be increased through 
5 homologous recombination, for example by inserting a stronger promoter into 

the host genome to cause increased expression, by removing destabilizing 
sequences from either the mRNA or the encoded protein by deleting that 
information from the host genome, or by adding stabilizing sequences to the 
mRNA (USPN 4,910,141). 

10 When it is desirable to express more than one different gene, appropriate 

regulatory regions and expression methods, introduced genes can be propagated 
in the host cell through use of replicating vectors or by integration into the host 
genome. Where two or more genes are expressed from separate replicating 
vectors, it is desirable that each vector has a different means of replication. 

15 Each introduced construct, whether integrated or not, should have a different 

means of selection and should lack homology to the other constructs to maintain 
stable expression and prevent reassortment of elements among constructs. 
Judicious choices of regulatory regions, selection means and method of 
propagation of the introduced construct can be experimentally determined so 

20 that all introduced genes are expressed at the necessary levels to provide for 

synthesis of the desired products. 

As an example, where the host cell is a yeast, transcriptional and 
translational regions functional in yeast cells are provided, particularly from the 
host species. The transcriptional initiation regulatory regions can be obtained, 

25 for example from genes in the glycolytic pathway, such as alcohol 

dehydrogenase, glyceraldehyde-3 -phosphate dehydrogenase (GPD), 
phosphoglucoisomerase, phosphoglycerate kinase, etc. or regulatable genes 
such as acid phosphatase, lactase, metallothionein, glucoamylase, etc. Any one 
of a number of regulatory sequences can be used in a particular situation, 

30 depending upon whether constitutive or induced transcription is desired, the 

particular efficiency of the promoter in conjunction with the open-reading frame 
of interest, the ability to join a strong promoter with a control region from a 
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different promoter which allows for inducible transcription, ease of 
construction, and the like. Of particular interest are promoters which are 
activated in the presence of galactose. Galactose-inducible promoters (GAL1, 
GAL7, and GAL 10) have been extensively utilized for high level and regulated 
5 expression of protein in yeast (Lue et a/., Mol Cell. Biol Vol. 7, p. 3446, 1987; 

Johnston, Microbiol Rev. Vol. 51, p. 458, 1987). Transcription from the GAL 
promoters is activated by the GAL4 protein, which binds to the promoter region 
and activates transcription when galactose is present. In the absence of 
galactose, the antagonist GAL 80 binds to GAL4 and prevents GAL4 from 
10 activating transcription. Addition of galactose prevents GAL80 from inhibiting 

activation by GAL4. 

Nucleotide sequences surrounding the translational initiation codon 
ATG have been found to affect expression in yeast cells. If the desired 
polypeptide is poorly expressed in yeast, the nucleotide sequences of exogenous 
15 genes can be modified to include an efficient yeast translation initiation 

sequence to obtain optimal gene expression. For expression in Saccharomyces, 
this can be done by site-directed mutagenesis of an inefficiently expressed gene 
by fusing it in-frame to an endogenous Saccharomyces gene, preferably a highly 
expressed gene, such as the lactase gene. 

20 The termination region can be derived from the 3' region of the gene 

from which the initiation region was obtained or from a different gene. A large 
number of termination regions are known to and have been found to be 
satisfactory in a variety of hosts from the same and different genera and species. 
The termination region usually is selected more as a matter of convenience 

25 rather than because of any particular property. Preferably, the termination 

region is derived from a yeast gene, particularly Saccharomyces, 
Schizosaccharomyces, Candida or Kluyveromyces. The 3 ' regions of two 
mammalian genes, y interferon and ct2 interferon, are also known to function in 
yeast. 
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INTRODUCTION OF CONSTRUCTS INTO HOST CELLS 

Constructs comprising the gene of interest may be introduced into a host 
cell by standard techniques. These techniques include transformation, 
protoplast fusion, lipofection, transfection, transduction, conjugation, infection, 
5 holistic impact, electroporation, microinjection, scraping, or any other method 

which introduces the gene of interest into the host cell. Methods of 
transformation which are used include lithium acetate transformation {Methods 
in Enzymology, Vol. 194, p. 186-187, 1991). For convenience, a host cell which 
has been manipulated by any method to take up a DNA sequence or construct 
1 0 will be referred to as "transformed" or "recombinant" herein. 

The subject host will have at least have one copy of the expression 
construct and may have two or more, depending upon whether the gene is 
integrated into the genome, amplified, or is present on an extrachromosomal 
element having multiple copy numbers. Where the subject host is a yeast, four 

1 5 principal types of yeast plasmid vectors can be used: Yeast Integrating plasmids 

(Yips), Yeast Replicating plasmids (YRps), Yeast Centromere plasmids 
(YCps), and Yeast Episomal plasmids (YEps). Yips lack a yeast replication 
origin and must be propagated as integrated elements in the yeast genome. 
YRps have a chromosomally derived autonomously replicating sequence and 

20 are propagated as medium copy number (20 to 40), autonomously replicating, 

unstably segregating plasmids. YCps have both a replication origin and a 
centromere sequence and propagate as low copy number (10-20), autonomously 
replicating, stably segregating plasmids. YEps have an origin of replication 
from the yeast 2\xm plasmid and are propagated as high copy number, 

25 autonomously replicating, irregularly segregating plasmids. The presence of the 

plasmids in yeast can be ensured by maintaining selection for a marker on the 
plasmid. Of particular interest are the yeast vectors pYES2 (a YEp plasmid 
available from Invitrogen, confers uracil proto trophy and a GAL1 galactose- 
inducible promoter for expression), pRS425-pGl (a YEp plasmid obtained from 

30 Dr. T. H. Chang, Ass. Professor of Molecular Genetics, Ohio State University, 

containing a constitutive GPD promoter and conferring leucine prototrophy), 
and pYX424 (a YEp plasmid having a constitutive TP1 promoter and conferring 
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leucine prototrophy; Alber, T. and Kawasaki, G. (1982). J. Mol. & Appl 
Genetics 1: 419). 

The transformed host cell can be identified by selection for a marker 
contained on the introduced construct. Alternatively, a separate marker 
5 construct may be introduced with the desired construct, as many transformation 

techniques introduce many DNA molecules into host cells. Typically, 
transformed hosts are selected for their ability to grow on selective media. 
Selective media may incorporate an antibiotic or lack a factor necessary for 
growth of the untransformed host, such as a nutrient or growth factor. An 

10 introduced marker gene therefor may confer antibiotic resistance, or encode an 

essential growth factor or enzyme, and permit growth on selective media when 
expressed in the transformed host. Selection of a transformed host can also 
occur when the expressed marker protein can be detected, either directly or 
indirectly. The marker protein may be expressed alone or as a fusion to another 

1 5 protein. The marker protein can be detected by its enzymatic activity; for 

example p galactose dase can convert the substrate X-gal to a colored product, 
and luciferase can convert luciferin to a light-emitting product. The marker 
protein can be detected by its light-producing or modifying characteristics; for 
example, the green fluorescent protein of Aequorea victoria fluoresces when 

20 illuminated with blue light. Antibodies can be used to detect the marker 

protein or a molecular tag on, for example, a protein of interest. Cells 
expressing the marker protein or tag can be selected, for example, visually, or 
by techniques such as F ACS. or panning using antibodies. For selection of yeast 
transformants, any marker that functions in yeast may be used. Desirably, 

25 resistance to kanamycin and the amino glycoside G41 8 are of interest, as well as 

ability to grow on media lacking uracil, leucine, lysine or tryptophan. 

Of particular interest is the A6- and Al 2-desaturase-mediated production 
of PUFAs in prokaryotic and eukaryotic host cells. Prokaryotic cells of interest 
include Eschericia, Bacillus, Lactobacillus, cyanobacteria and the like. 
30 Eukaryotic cells include mammalian cells such as those of lactating animals, 

avian cells such as of chickens, and other cells amenable to genetic 
manipulation including insect, fungal, and algae cells. The cells may be 
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cultured or formed as part or all of a host organism including an animal. 
Viruses and bacteriophage also may be used with the cells in the production of 
PUFAs, particularly for gene transfer, cellular targeting and selection. In a 
preferred embodiment, the host is any microorganism or animal which produces 
5 and/or can assimilate exogenously supplied substrate(s) for a A6- and/or A12- 

desaturase, and preferably produces large amounts of one or more of the 
substrates. Examples of host animals include mice, rats, rabbits, chickens, quail, 
turkeys, bovines, sheep, pigs, goats, yaks, etc., which are amenable to genetic 
manipulation and cloning for rapid expansion of the transgene expressing 
1 0 population. For animals, the desaturase transgene(s) can be adapted for 

expression in target organelles, tissues and body fluids through modification of 
the gene regulatory regions. Of particular interest is the production of PUFAs 
in the breast milk of the host animal. 

Expression In Yeast 

15 Examples of host microorganisms include Saccharomyces cerevisiae, 

Saccharomyces carlsbergensis, or other yeast such as Candida, Kluyveromyces 
or other fungi, for example, filamentous fungi such as Aspergillus, Neurospora, 
Penicillium, etc. Desirable characteristics of a host microorganism are, for 
example, that it is genetically well characterized, can be used for high level 

20 expression of the product using ultra-high density fermentation, and is on the 

GRAS (generally recognized as safe) list since the proposed end product is 
intended for ingestion by humans. Of particular interest is use of a yeast, more 
particularly baker's yeast (S. cerevisiae), as a cell host in the subject invention. 
Strains of particular interest are SC334 (Mat a pep4-3 prbl-1 122 ura3-52 leu2- 

25 3, 1 12 regl-501 gall; Gene 83:57-64, 1989, Hovland P. et al\ YTC34 (a ade2- 

101 his3A200 lys2-801 ura3-52; obtained from Dr. T. H. Chang, Ass. Professor 
of Molecular Genetics, Ohio State University), YTC41 (a/a ura3-52/ura3=52 
Iys2-801/lys2-801 ade2-101/ade2-101 trpl-Al/trpl-Al his3A200/his3A200 
leu2Al/leu2Al ; obtained from Dr. T. H. Chang, Ass. Professor of Molecular 

30 Genetics, Ohio State University), BJ1995 (obtained from the Yeast Genetic 
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Stock Centre, 1021 Donner Laboratory, Berkeley, CA 94720), INVSC1 (Mat a 
hiw3Al leu2 trpl-289 ura3-52; obtained from Invitrogen, 1600 Faraday Ave., 
Carlsbad, CA 92008) and INVSC2 (Mat a his3A200 ura3-167; obtained from 
Invitrogen). 

5 Expression in Avian Species 

For producing PUFAs in avian species and cells, such as chickens, 
turkeys, quail and ducks, gene transfer can be performed by introducing a 
nucleic acid sequence encoding a A6 and/or A12-desaturase into the cells 
following procedures known in the art. If a transgenic animal is desired, 

10 pluripotent stem cells of embryos can be provided with a vector carrying a 

desaturase encoding transgene and developed into adult animal (USPN 
5,162,215; Ono et aL (1996) Comparative Biochemistry and Physiology A 
773(3):287-292; WO 9612793; WO 9606160). In most cases, the transgene 
will be modified to express high levels of the desaturase in order to increase 

1 5 production of PUFAs. The transgene can be modified, for example, by 

providing transcriptional and/or translational regulatory regions that function in 
avian cells, such as promoters which direct expression in particular tissues and 
egg parts such as yolk. The gene regulatory regions can be obtained from a 
variety of sources, including chicken anemia or avian leukosis viruses or avian 

20 genes such as a chicken ovalbumin gene. 

Expression in Insect Cells 

Production of PUFAs in insect cells can be conducted using baculovirus 
expression vectors harboring one or more desaturase transgenes. Baculovirus 
expression vectors are available from several commercial sources such as 
25 Clonetech. Methods for producing hybrid and transgenic strains of algae, such 

as marine algae, which contain and express a desaturase transgene also are 
provided. For example, transgenic marine algae may be prepared as described 
in USPN 5,426,040. As with the other expression systems described above, the 
timing, extent of expression and activity of the desaturase transgene can be 
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regulated by fitting the polypeptide coding sequence with the appropriate 
transcriptional and translational regulatory regions selected for a particular use. 
Of particular interest are promoter regions which can be induced under 
preselected growth conditions. For example, introduction of temperature 
5 sensitive and/or metabolite responsive mutations into the desaturase transgene 

coding sequences, its regulatory regions, and/or the genome of cells into which 
the transgene is introduced can be used for this purpose. 

The transformed host cell is grown under appropriate conditions adapted 
for a desired end result. For host cells grown in culture, the conditions are 

10 typically optimized to produce the greatest or most economical yield of PUFAs, 

which relates to the selected desaturase activity. Media conditions which may 
be optimized include: carbon source, nitrogen source, addition of substrate, 
final concentration of added substrate, form of substrate added, aerobic or 
anaerobic growth, growth temperature, inducing agent, induction temperature, 

1 5 growth phase at induction, growth phase at harvest, pH, density, and 

maintenance of selection. Microorganisms of interest, such as yeast are 
preferably grown in selected medium. For yeast, complex media such as 
_ peptone broth (YPD) or a defined media such as a minimal media (contains 
amino acids, yeast nitrogen base, and ammonium sulfate, and lacks a 

20 component for selection, for example uracil) are preferred. Desirably, 

substrates to be added are first dissolved in ethanol. Where necessary, 
expression of the polypeptide of interest may be induced, for example by 
including or adding galactose to induce expression from a GAL promoter. 

Expression In Plants 

25 Production of PUFA's in plants can be conducted using various plant 

transformation systems such as the use of Agrobacterium tumefaciens, plant 
viruses, particle cell transformation and the like which are disclosed in 
Applicants related applications U.S. Application Serial Nos. 08/834,033 and 
08/956,985 and continuation-in-part applications filed simultaneously with this 

30 application all of which are hereby incorporated by reference. 
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Expression In An Animal 

Expression in cells of a host animal can likewise be accomplished in a 
transient or stable manner. Transient expression can be accomplished via known 
methods, for example infection or lipofection, and can be repeated in order to 
5 maintain desired expression levels of the introduced construct ( see Ebert, PCT 

publication WO 94/05782). Stable expression can be accomplished via 
integration of a construct into the host genome, resulting in a transgenic animal. 
The construct can be introduced, for example, by microinjection of the construct 
into the pronuclei of a fertilized egg, or by transfection, retroviral infection or 

10 other techniques whereby the construct is introduced into a cell line which may 

form or be incorporated into an adult animal (U.S. Patent No. 4,873,191; U.S. 
Patent No. 5,530,177; U.S. Patent No. 5,565,362; U.S. Patent No. 5,366,894; 
Willmut et al (1997) Nature 385:810). The recombinant eggs or embryos are 
transferred to a surrogate mother (U.S. Patent No. 4,873,191 ; U.S. Patent No. 

15 5,530,177; U.S. Patent No. 5,565,362; U.S. Patent No. 5,366,894; Wilmut et al 

(supra)). 

After birth, transgenic animals are identified, for example, by the 
presence of an introduced marker gene, such as for coat color, or by PCR or 
Southern blotting from a blood, milk or tissue sample to detect the introduced 

20 construct, or by an immunological or enzymological assay to detect the 

expressed protein or the products produced therefrom (U.S. Patent No. 
4,873,191; U.S. Patent No. 5,530,177; U.S. Patent No. 5,565,362; U.S. Patent 
No. 5,366,894; Wilmut et al (supra)). The resulting transgenic animals may be 
entirely transgenic or may be mosaics, having the transgenes in only a subset of 

25 their cells. The advent of mammalian cloning, accomplished by fusing a 

nucleated cell with an enucleated egg, followed by transfer into a surrogate 
mother, presents the possibility of rapid, large-scale production upon obtaining 
a "founder" animal or cell comprising the introduced construct; prior to this, it 
was necessary for the transgene to be present in the germ line of the animal for 

30 propagation (Wilmut et al (supra)). 
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Expression in a host animal presents certain efficiencies, particularly 
where the host is a domesticated animal. For production of PUFAs in a fluid 
readily obtainable from the host animal, such as milk, the desaturase transgene 
can be expressed in mammary cells from a female host, and the PUFA content 
5 of the host cells altered. The desaturase transgene can be adapted for expression 

so that it is retained in the mammary cells, or secreted into milk, to form the 
PUFA reaction products localized to the milk (PCT publication WO 95/24488). 
Expression can be targeted for expression in mammary tissue using specific 
regulatory sequences, such as those of bovine a-lactalbumin, a-casein, (5- 

10 casein, y-casein, K-casein, P-lactoglobulin, or whey acidic protein, and may 

optionally include one or more introns and/or secretory signal sequences (U.S. 
Patent No. 5,530,177; Rosen, U.S. Patent No. 5,565,362; Clark etal., U.S. 
Patent No. 5,366,894; Gamer et al., PCT publication WO 95/23868). 
Expression of desaturase transgenes, or antisense desaturase transcripts, adapted 

15 in this manner can be used to alter the levels of specific PUFAs, or derivatives 

thereof, found in the animals milk. Additionally, the desaturase transgene(s) 
can be expressed either by itself or with other transgenes, in order to produce 
animal milk containing higher proportions of desired PUFAs or PUFA ratios 
and concentrations that resemble human breast milk (Prieto et ai 9 PCT 

20 publication WO 95/24494). 

PURIFICATION OF FATTY ACIDS 

The desaturated fatty acids may be found in the host microorganism or 
animal as free fatty acids or in conjugated forms such as acylglycerols, 
phospholipids, sulfolipids or glycolipids, and may be extracted from the host 

25 cell through a variety of means well-known in the art. Such means may include 

extraction with organic solvents, sonication, supercritical fluid extraction using 
for example carbon dioxide, and physical means such as presses, or 
combinations thereof. Of particular interest is extraction with hexane or 
methanol and chloroform. Where desirable, the aqueous layer can be acidified 

30 to protonate negatively charged moieties and thereby increase partitioning of 

desired products into the organic layer. After extraction, the organic solvents 
can be removed by evaporation under a stream of nitrogen. When isolated in 
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conjugated forms, the products may be enzymatically or chemically cleaved to 
release the free fatty acid or a less complex conjugate of interest, and can then 
be subject to further manipulations to produce a desired end product. Desirably, 
conjugated forms of fatty acids are cleaved with potassium hydroxide. 

5 If further purification is necessary, standard methods can be employed. 

Such methods may include extraction, treatment with urea, fractional 
crystallization, HPLC, fractional distillation, silica gel chromatography, high 
speed centrifugation or distillation, or combinations of these techniques. 
Protection of reactive groups, such as the acid or alkenyl groups, may be done at 
1 0 any step through known techniques, for example alkylation or iodination. 

Methods used include methylation of the fatty acids to produce methyl esters. 
Similarly, protecting groups may be removed at any step. Desirably, 
purification of fractions containing GLA, SDA, ARA, DHA and EPA may be 
accomplished by treatment with urea and/or fractional distillation. 

1 5 USES OF FATTY ACIDS 

The fatty acids of the subject invention finds many applications. Probes 
based on the DNAs of the present invention may find use in methods for 
isolating related molecules or in methods to detect organisms expressing 
desaturases. When used as probes, the DNAs or oligonucleotides must be 

20 detectable. This is usually accomplished by attaching a label either at an 

internal site, for example via incorporation of a modified residue, or at the 5' or 
3' terminus. Such labels can be directly detectable, can bind to a secondary 
molecule that is detectably labeled, or can bind to an unlabelled secondary 
molecule and a detectably labeled tertiary molecule; this process can be 

25 extended as long as is practical to achieve a satisfactorily detectable signal 

without unacceptable levels of background signal. Secondary, tertiary, or 
bridging systems can include use of antibodies directed against any other 
molecule, including labels or other antibodies, or can involve any molecules 
which bind to each other, for example a biotin-streptavidin/avidin system. 

30 Detectable labels typically include radioactive isotopes, molecules which 

chemically or enzymatically produce or alter light, enzymes which produce 
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detectable reaction products, magnetic molecules, fluorescent molecules or 
molecules whose fluorescence or light-emitting characteristics change upon 
binding. Examples of labelling methods can be found in USPN 5,01 1,770. 
Alternatively, the binding of target molecules can be directly detected by 
5 measuring the change in heat of solution on binding of probe to target via 

isothermal titration calorimetry, or by coating the probe or target on a surface 
and detecting the change in scattering of light from the surface produced by 
binding of target or probe, respectively, as may be done with the BIAcore 
system. 

1 0 PUFAs produced by recombinant means find applications in a wide 

variety of areas. Supplementation of animals or humans with PUFAs in various 
forms can result in increased levels not only of the added PUFAs but of their 
metabolic progeny as well. 

NUTRITIONAL COMPOSITIONS 

1 5 The present invention also includes nutritional compositions. Such 

compositions, for purposes of the present invention, include any food or 
preparation for human consumption including for enteral or parenteral 
consumption, which when taken into the body (a) serve to nourish or build up 
tissues or supply energy and/or (b) maintain, restore or support adequate 

20 nutritional status or metabolic function. 

The nutritional composition of the present invention comprises at least 
one oil or acid produced in accordance with the present invention and may 
either be in a solid or liquid form. Additionally, the composition may include 
edible macronutrients, vitamins and minerals in amounts desired for a particular 
25 use. The amount of such ingredients will vary depending on whether the 

composition is intended for use with normal, healthy infants, children or adults 
having specialized needs such as those which accompany certain metabolic 
conditions (e.g., metabolic disorders). 

Examples of macronutrients which may be added to the composition 
30 include but are not limited to edible fats, carbohydrates and proteins. Examples 

of such edible fats include but are not limited to coconut oil, soy oil, and mono- 
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and diglycerides. Examples of such carbohydrates include but are not limited to 
glucose, edible lactose and hydrolyzed search. Additionally, examples of 
proteins which may be utilized in the nutritional composition of the invention 
include but are not limited to soy proteins, electrodialysed whey , 
5 electrodialysed skim milk, milk whey, or the hydrolysates of these proteins. 

With respect to vitamins and minerals, the following may be added to 
the nutritional compositions of the present invention: calcium, phosphorus, 
potassium, sodium, chloride, magnesium, manganese, iron, copper, zinc, 
selenium, iodine, and Vitamins A, E, D, C, and the B complex. Other such 
10 vitamins and minerals may also be added. 

The components utilized in the nutritional compositions of the present 
invention will of semi-purified or purified origin. By semi-purified or purified 
is meant a material which has been prepared by purification of a natural 
material or by synthesis. 

15 Examples of nutritional compositions of the present invention include 

but are not limited to infant formulas, dietary supplements, and rehydration 
compositions. Nutritional compositions of particular interest include but are not 
limited to those utilized for enteral and parenteral supplementation for infants, 
specialist infant formulae, supplements for the elderly, and supplements for 

20 those with gastrointestinal difficulties and/or malabsorption. 

Nutritional Compositions 

A typical nutritional composition of the present invention will contain 
edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amounts of such ingredients will vary depending on whether the 

25 formulation is intended for use with normal, healthy individuals temporarily 

exposed to stress, or to subjects having specialized needs due to certain chronic 
or acute disease states (e.g., metabolic disorders). It will be understood by 
persons skilled in the art that the components utilized in a nutritional 
formulation of the present invention are of semi-purified or purified origin. By 

30 semi-purified or purified is meant a material that has been prepared by 
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purification of a natural material or by synthesis. These techniques are well 
known in the art (See, e.g., Code of Federal Regulations for Food Ingredients 
and Food Processing; Recommended Dietary Allowances, 10 th Ed., National 
Academy Press, Washington, D.C., 1989). 

5 In a preferred embodiment, a nutritional formulation of the present 

invention is an enteral nutritional product, more preferably an adulf or child 
enteral nutritional product. Accordingly in a further aspect of the invention, a 
nutritional formulation is provided that is suitable for feeding adults or children, 
who are experiencing stress. The formula comprises, in addition to the PUFAs 
1 0 of the invention; macronutrients, vitamins and minerals in amounts designed to 

provide the daily nutritional requirements of adults. 

The macronutritional components include edible fats, carbohydrates and 
proteins. Exemplary edible fats are coconut oil, soy oil, and mono- and 
diglycerides and the PUFA oils of this invention. Exemplary carbohydrates are 

15 glucose, edible lactose and hydrolyzed cornstarch. A typical protein source 

would be soy protein, electrodialysed whey or electrodialysed skim milk or milk 
whey, or the hydrolysates of these proteins, although other protein sources are 
also available and may be used. These macronutrients would be added in the 
form of commonly accepted nutritional compounds in amount equivalent to 

20 those present in human milk or an energy basis, i.e., on a per calorie basis. 

Methods for formulating liquid and enteral nutritional formulas are well 
known in the art and are described in detail in the examples. 

The enteral formula can be sterilized and subsequently utilized on a 
ready-to-feed (RTF) basis or stored in a concentrated liquid or a powder. The 

25 powder can be prepared by spray drying the enteral formula prepared as 

indicated above, and the formula can be reconstituted by rehydrating the 
concentrate. Adult and infant nutritional formulas are well known in the art and 
commercially available (e.g., Similac®, Ensure®, Jevity® and Alimentum® 
from Ross Products Division, Abbott Laboratories). An oil or acid of the 

30 present invention can be added to any of these formulas in the amounts 

described below. 
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The energy density of the nutritional composition when in liquid form, 
can typically range from about 0.6 to 3.0 Kcal per ml. When in solid or 
powdered form, the nutritional supplement can contain from about 1 .2 to more 
than 9 Kcals per gm, preferably 3 to 7 Kcals per gm. In general, the osmolality 
of a liquid product should be less than 700 mOsm and more preferably less than 
660 mOsm. 

The nutritional formula would typically include vitamins and minerals, 
in addition to the PUFAs of the invention, in order to help the individual ingest 
the minimum daily requirements for these substances. In addition to the PUFAs 
listed above, it may also be desirable to supplement the nutritional composition 
with zinc, copper, and folic acid in addition to antioxidants. It is believed that 
these substances will also provide a boost to the stressed immune system and 
thus will provide further benefits to the individual. The presence of zinc, 
copper or folic acid is optional and is not required in order to gain the beneficial 
effects on immune suppression. Likewise a pharmaceutical composition can be 
supplemented with these same substances as well. 

In a more preferred embodiment, the nutritional contains, in addition to 
the antioxidant system and the PUFA component, a source of carbohydrate 
wherein at least 5 weight % of said carbohydrate is an indigestible 
oligosaccharide. In yet a more preferred embodiment, the nutritional 
composition additionally contains protein, taurine and carnitine. 

The PUFAs, or derivatives thereof, made by the disclosed method can 
be used as dietary substitutes, or supplements, particularly infant formulas, for 
patients undergoing intravenous feeding or for preventing or treating 
malnutrition. Typically, human breast milk has a fatty acid profile comprising 
from about 0.15 % to about 0.36 % as DHA, from about 0.03 % to about 0.13 % 
as EPA, from about 0.30 % to about 0.88 % as ARA, from about 0.22 % to 
about 0.67 % as DGLA, and from about 0.27 % to about 1.04 % as GLA. 
Additionally, the predominant triglyceride in human milk has been reported to 
be l,3-di-oleoyl-2-palmitoyl, with 2-palmitoyI glycerides reported as better 
absorbed than 2-oleoyl or 2-lineoyl glycerides (USPN 4,876,107). Thus, fatty 
acids such as ARA, DGLA, GLA and/or EPA produced by the invention can be 
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used to alter the composition of infant formulas to better replicate the PUFA 
composition of human breast milk. In particular, an oil composition for use in a 
pharmacologic or food supplement, particularly a breast milk substitute or 
supplement, will preferably comprise one or more of ARA, DGLA and GLA. 
5 More preferably the oil will comprise from about 0.3 to 30% ARA, from about 

0.2 to 30% DGLA, and from about 0.2 to about 30% GLA. 

In addition to the concentration, the ratios of ARA, DGLA and GLA can 
be adapted for a particular given end use. When formulated as a breast milk 
supplement or substitute, an oil composition which contains two or more of 

10 ARA, DGLA and GLA will be provided in a ratio of about 1 : 19:30 to about 

6:1 :0.2, respectively. For example, the breast milk of animals can vary in ratios 
of ARA:DGLA:DGL ranging from 1:19:30 to 6:1:0.2, which includes 
intermediate ratios which are preferably about 1:1:1, 1:2:1, 1:1:4. When 
produced together in a host cell, adjusting the rate and percent of conversion of 

1 5 a precursor substrate such as GLA and DGLA to ARA can be used to precisely 

control the PUFA ratios. For example, a 5% to 10% conversion rate of DGLA 
to ARA can be used to produce an ARA to DGLA ratio of about 1:19, whereas 
_ a conversion rate of about 75% to 80% can be used to produce an ARA to 
DGLA ratio of about 6: 1 . Therefore, whether in a cell culture system or in a 

20 host animal, regulating the timing, extent and specificity of desaturase 

expression as described can be used to modulate the PUFA levels and ratios. 
Depending on the expression system used, e.g., cell culture or an animal 
expressing oil(s) in its milk, the oils also can be isolated and recombined in the 
desired concentrations and ratios. Amounts of oils providing these ratios of 

25 PUFA can be determined following standard protocols. PUFAs, or host cells 

containing them, also can be used as animal food supplements to alter an 
animal's tissue or milk fatty acid composition to one more desirable for human 
or animal consumption. 

For dietary supplementation, the purified PUFAs, or derivatives thereof, 
30 may be incorporated into cooking oils, fats or margarines formulated so that in 

normal use the recipient would receive the desired amount. The PUFAs may 
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also be incorporated into infant formulas, nutritional supplements or other food 
products, and may find use as anti-inflammatory or cholesterol lowering agents. 

Pharmaceutical Compositions 

The present invention also encompasses a pharmaceutical composition 
5 comprising one or more of the acids and/or resulting oils produced in 

accordance with the methods described herein. More specifically, such a 
pharmaceutical composition may comprise one or more of the acids and/or oils 
as well as a standard, well-known, non-toxic pharmaceutically acceptable 
carrier, adjuvant or vehicle such as, for example, phosphate buffered saline, 
1 0 water, ethanol, polyols, vegetable oils, a wetting agent or an emulsion such as a 

water/oil emulsion. The composition may be in either a liquid or solid form. 
For example, the composition may be in the form of a tablet, capsule, ingestible 
liquid or powder, injectible, or topical ointment or cream. 

Possible routes of administration include, for example, oral, rectal and 
1 5 parenteral. The route of administration will, of course, depend upon the desired 

effect. For example, if the composition is being utilized to treat rough, dry, or 
aging skin, to treat injured or burned skin, or to treat skin or hair affected by a 
disease or condition, it may perhaps be applied topically. 

The dosage of the composition to be administered to the patient may be 
20 determined by one of ordinary skill in the art and depends upon various factors 

such as weight of the patient, age of the patient, immune status of the patient, 
etc. 

With respect to form, the composition may be, for example, a solution, a 
dispersion, a suspension, an emulsion or a sterile powder which is then 
25 reconstituted. 

Additionally, the composition of the present invention may be utilized 
for cosmetic purposes. It may be added to pre-existing cosmetic compositions 
such that a mixture is formed or may be used as a sole composition. 
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Pharmaceutical compositions may be utilized to administer the PUFA 
component to an individual. Suitable pharmaceutical compositions may 
comprise physiologically acceptable sterile aqueous or non-aqueous solutions, 
dispersions, suspensions or emulsions and sterile powders for reconstitution into 
5 sterile solutions or dispersions for ingestion. Examples of suitable aqueous and 

non-aqueous carriers, diluents, solvents or vehicles include water, ethanol, 
polyols (propyleneglycol, polyethyleneglycol, glycerol, and the like), suitable 
mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters 
such as ethyl oleate. Proper fluidity can be maintained, for example, by the 
10 maintenance of the required particle size in the case of dispersions and by the 

use of surfactants. It may also be desirable to include isotonic agents, for 
example sugars, sodium chloride and the like. Besides such inert diluents, the 
composition can also include adjuvants, such as wetting agents, emulsifying and 
suspending agents, sweetening, flavoring and perfuming agents. 

15 Suspensions, in addition to the active compounds, may contain 

suspending agents, as for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 
aluminum rnetahydroxide, bentonite, agar-agar and tragacanth or mixtures of 
these substances, and the like. 

20 Solid dosage forms such as tablets and capsules can be prepared using 

techniques well known in the art. For example, PUFAs of the invention can be 
tableted with conventional tablet bases such as lactose, sucrose, and cornstarch 
in combination with binders such as acacia, cornstarch or gelatin, disintegrating 
agents such as potato starch or alginic acid and a lubricant such as stearic acid 

25 or magnesium stearate. Capsules can be prepared by incorporating these 

excipients into a gelatin capsule along with the antioxidants and the PUFA 
component. The amount of the antioxidants and PUFA component that should 
be incorporated into the pharmaceutical formulation should fit within the 
guidelines discussed above. 

30 As used in this application, the term "treat" refers to either preventing, or 

reducing the incidence of, the undesired occurrence. For example, to treat 
immune suppression refers to either preventing the occurrence of this 
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suppression or reducing the amount of such suppression. The terms "patient" 
and "individual" are being used interchangeably and both refer to an animal. 
The term "animal" as used in this application refers to any warm-blooded 
mammal including, but not limited to, dogs, humans, monkeys, and apes. As 
5 used in the application the term "about" refers to an amount varying from the 

stated range or number by a reasonable amount depending upon the context of 
use. Any numerical number or range specified in the specification should be 
considered to be modified by the term about. 

"Dose" and "serving" are used interchangeably and refer to the amount 
10 of the nutritional or pharmaceutical composition ingested by the patient in a 

single setting and designed to deliver effective amounts of the antioxidants and 
the structured triglyceride. As will be readily apparent to those skilled in the 
art, a single dose or serving of the liquid nutritional powder should supply the 
amount of antioxidants and PUFAs discussed above. The amount of the .dose or 
1 5 serving should be a volume that a typical adult can consume in one sitting. This 

amount can vary widely depending upon the age, weight, sex or medical 
condition of the patient. However as a general guideline, a single serving or 
dose of a liquid nutritional produce should be considered as encompassing a 
volume from 100 to 600 ml, more preferably from 125 to 500 ml and most 
20 preferably from 125 to 300 ml. 

The PUFAs of the present invention may also be added to food even 
when supplementation of the diet is not required. For example, the composition 
may be added to food of any type including but not limited to margarines, 
modified butters, cheeses, milk, yogurt, chocolate, candy, snacks, salad oils, 
25 cooking oils, cooking fats, meats, fish and beverages. 

Pharmaceutical Applications 

For pharmaceutical use (human or veterinary), the compositions are 
generally administered orally but can be administered by any route by which 
they may be successfully absorbed, e.g., parenterally (i.e. subcutaneously, 
30 intramuscularly or intravenously), rectally or vaginally or topically, for 

example, as a skin ointment or lotion. The PUFAs of the present invention may 
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be administered alone or in combination with a pharmaceutical^ acceptable 
carrier or excipient. Where available, gelatin capsules are the preferred form of 
oral administration. Dietary supplementation as set forth above also can 
provide an oral route of administration. The unsaturated acids of the present 
5 invention may be administered in conjugated forms, or as salts, esters, amides 

or prodrugs of the fatty acids. Any pharmaceutically acceptable salt is 
encompassed by the present invention; especially preferred are the sodium, 
potassium or lithium salts. Also encompassed are the N-alkylpolyhydroxamine 
salts, such as N-methyl glucamine, found in PCT publication WO 96/33155. 

1 0 The preferred esters are the ethyl esters. As solid salts, the PUF As also can be 

administered in tablet form. For intravenous administration, the PUF As or 
derivatives thereof may be incorporated into commercial formulations such as 
Intralipids. The typical normal adult plasma fatty acid profile comprises 6.64 to 
9.46% of ARA, 1 .45 to 3.11% of DGLA, and 0.02 to 0.08% of GLA. These 

1 5 PUFAs or their metabolic precursors can be administered, either alone or in 

mixtures with other PUFAs, to achieve a normal fatty acid profile in a patient. 
Where desired, the individual components of formulations may be individually 
provided in kit form, for single or multiple use. A typical dosage of a particular 
fatty acid is from 0.1 mg to 20 g, or even 100 g daily, and is preferably from 10 

20 mg to 1, 2, 5 or 10 g daily as required, or molar equivalent amounts of 

derivative forms thereof. Parenteral nutrition compositions comprising from 
about 2 to about 30 weight percent fatty acids calculated as triglycerides are 
encompassed by the present invention; preferred is a composition having from 
about 1 to about 25 weight percent of the total PUFA composition as GLA 

25 (USPN 5,196,198). Other vitamins, and particularly fat-soluble vitamins such 

as vitamin A, D, E and L-carnitine can optionally be included. Where desired, a 
preservative such as a tocopherol may be added, typically at about 0.1% by 
weight. 

Suitable pharmaceutical compositions may comprise physiologically 
30 acceptable sterile aqueous or non-aqueous solutions, dispersions, suspensions or 

emulsions and sterile powders for reconstitution into sterile injectible solutions 
or dispersions. Examples of suitable aqueous and non-aqeuous carriers, 
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diluents, solvents or vehicles include water, ethanol, polyols (propylleneglyol, 
polyethylenegycol, glycerol, and the like), suitable mixtures thereof, vegetable 
oils (such as olive oil) and injectable organic esters such as ehyl oleate. Proper 
fluidity can be maintained, for example, by the maintenance of the required 
5 particle size in the case of dispersions and by the use of surfactants. It may also 

be desirable to include isotonic agents, for example sugars, sodium .chloride and 
the like. Besides such inert diluents, the composition can also include 
adjuvants, such as wetting agents, emulsifying and suspending agents, 
sweetening, flavoring and perfuming agents. 

10 Suspensions in addition to the active compounds, may contain 

suspending agents, as for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 
aluminum metahydroxide, bentonite, agar-agar and tragacanth, or mixtures of 
these substances and the like. 

15 An especially preferred pharmaceutical composition contains 

diacetyltartaric acid esters of mono- and diglycerides dissolved in an aqueous 
medium or solvent. Diacetyltartaric acid esters of mono- and diglycerides have 
an HLB value of about 9-12 and are significantly more hydrophilic than existing 
antimicrobial lipids that have HLB values of 2-4. Those existing hydrophobic 

20 lipids cannot be formulated into aqueous compositions. As disclosed herein, 

those lipids can now be solubilized into aqueous media in combination with 
diacetyltartaric acid esters of mono-and diglycerides. In accordance with this 
embodiment, diacetyltartaric acid esters of mono- and diglycerides (e.g., 
DATEM-C12:0) is melted with other active antimicrobial lipids (e.g., 18:2 and 

25 12:0 monoglycerides) and mixed to obtain a homogeneous mixture. 

Homogeneity allows for increased antimicrobial activity. The mixture can be 
completely dispersed in water. This is not possible without the addition of 
diacetyltartaric acid esters of mono- and diglycerides and premixing with other 
monoglycerides prior to introduction into water. The aqueous composition can 

30 then be admixed under sterile conditions with physiologically acceptable 

diluents, preservatives, buffers or propellants as may be required to form a spray 
or inhalant. 
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The present invention also encompasses the treatment of numerous 
disorders with fatty acids. Supplementation with PUFAs of the present 
invention can be used to treat restenosis after angioplasty. Symptoms of 
inflammation, rheumatoid arthritis, and asthma and psoriasis can be treated with 
5 the PUFAs of the present invention. Evidence indicates that PUFAs may be 

involved in calcium metabolism, suggesting that PUFAs of the present 
invention may be used in the treatment or prevention of osteoporosis and of 
kidney or urinary tract stones. 

The PUFAs of the present invention can be used in the treatment of 
10 cancer. Malignant cells have been shown to have altered fatty acid 

compositions; addition of fatty acids has been shown to slow their growth and 
cause cell death, and to increase their susceptibility to chemotherapeutic agents. 
GLA has been shown to cause reexpression on cancer cells of the E-cadherin 
cellular adhesion molecules, loss of which is associated with aggressive 
1 5 metastasis. Clinical testing of intravenous administration of the water soluble 

lithium salt of GLA to pancreatic cancer patients produced statistically 
significant increases in their survival. PUFA supplementation may also be 
useful for treating cachexia associated with cancer. 

The PUFAs of the present invention can also be used to treat diabetes 
20 (USPN 4,826,877; Horrobin et al„ Am. J. Clin. Nutr. Vol. 57 (Suppl.), 732S- 

737S). Altered fatty acid metabolism and composition has been demonstrated 
in diabetic animals. These alterations have been suggested to be involved in 
some of the long-term complications resulting from diabetes, including 
retinopathy, neuropathy, nephropathy and reproductive system damage. 
25 Primrose oil, which contains GLA, has been shown to prevent and reverse 

diabetic nerve damage. 

The PUFAs of the present invention can be used to treat eczema, reduce 
blood pressure and improve math scores. Essential fatty acid deficiency has 
been suggested as being involved in eczema, and studies have shown beneficial 
30 effects on eczema from treatment with GLA. GLA has also been shown to 

reduce increases in blood pressure associated with stress, and to improve 
performance on arithmetic tests. GLA and DGLA have been shown to inhibit 
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platelet aggregation, cause vasodilation, lower cholesterol levels and inhibit 
proliferation of vessel wall smooth muscle and fibrous tissue (Brenner et al. 9 
Adv. Exp. Med. Biol. Vol. 83, p. 85-101, 1976). Administration of GLA or 
DGLA, alone or in combination with EPA, has been shown to reduce or prevent 
5 gastro-intestinal bleeding and other side effects caused by non-steroidal anti- 

inflammatory drugs (USPN 4,666,701). GLA and DGLA have also been shown 
to prevent or treat endometriosis and premenstrual syndrome (USPN 4,758,592) 
and to treat myalgic encephalomyelitis and chronic fatigue after viral infections 
(USPN 5,116,871). 

1 0 Further uses of the PUFAs of this invention include use in treatment of 

AIDS, multiple schlerosis, acute respiratory syndrome, hypertension and 
inflammatory skin disorders. The PUFAs of the inventions also can be used for 
formulas for general health as well as for geriatric treatments. 

Veterinary Applications 

15 It should be noted that the above-described pharmaceutical and 

nutritional compositions may be utilized in connection with animals, as well as 
humans, as animals experience many of the same needs and conditions as 
human. For example, the oil or acids of the present invention may be utilized in 
animal feed supplements. 

20 The following examples are presented by way of illustration, not of 

limitation. 

Examples 

Example 1 Construction of a cDNA Library from Mortierella alpina 

Example 2 Isolation of a A6-desaturase Nucleotide Sequence from 
25 Mortierella alpina 

Example 3 Identification of A6-desaturases Homologous to the 
Mortierella alpina A6-desaturase 

Example 4 Isolation of a Al 2-desaturase Nucleotide Sequence from 
Mortierella Alpina 
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Example 5 Expression of M. alpina Desaturase Clones in Baker's 
Yeast 

Example 6 Initial Optimization of Culture Conditions 

Example 7 Distribution of PUF As in Yeast Lipid Fractions 

Example 8 Further Culture Optimization and Coexpression of A6 
and A12-desaturases 

Example 9 Identification of Homologues to M. alpina A5 and A6 
desaturases 

Example 10 Identification of M alpina A5 and A6 homologues in 
other PUFA-producing organisms 

Example 1 1 Identification of M. alpina A5 and A6 homologues in 
other PUFA-producing organisms 

Example 1 2 Human Desaturase Gene Sequences 

Example 13 Nutritional Compositions 



15 



Example 1 



Construction of a cDNA Library from Mortierella alpina 

Total RNA was isolated from a 3 day old PUFA-producing culture of 
Mortierella alpina using the protocol of Hoge et al. (1982) Experimental 

20 Mycology 6:225-232. The RNA was used to prepare double-stranded cDNA 

using BRL's lambda-ZipLox system following the manufactures instructions. 
Several size fractions of the M. alpina cDNA were packaged separately to yield 
libraries with different average-sized inserts. A "full-length" library contains 
approximately 3 x 10 6 clones with an average insert size of 1.77 kb. The 

25 "sequencing-grade" library contains approximately 6 x 10 5 clones with an 

average insert size of 1.1 kb. 
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Example 2 

Isolation of a A6-desaturase Nucleotide Sequence from Mortierella Alpina 

A nucleic acid sequence from a partial cDNA clone, Ma524, encoding a 
A6 fatty acid desaturase from Mortierella alpina was obtained by random 
5 sequencing of clones from the M alpina cDNA sequencing grade library 

described in Example 1 . cDNA -containing plasmids were excised as follows: 

Five |al of phage were combined with 100 \x\ of E. coli DHIOB(ZIP) 
grown in ECLB plus 10 ng/ml kanamycin, 0.2% maltose, and 10 mM MgSC>4 
and incubated at 37 degrees for 15 minutes. 0.9 ml SOC was added and 100 j^l 

10 of the bacteria immediately plated on each of 10 ECLB + 50 yxg Pen plates. No 

45 minute recovery time was needed. The plates were incubated overnight at 
37°. Colonies were picked into ECLB + 50 |ag Pen media for overnight cultures 
to be used for making glycerol stocks and miniprep DN A. An aliquot of the 
culture used for the miniprep is stored as a glycerol stock. Plating on ECLB + 

15 50 jag Pen/ml resulted in more colonies and a greater proportion of colonies 

containing inserts than plating on 100 |ig/ml Pen. 

Random colonies were picked and plasmid DNA purified using Qiagen 
miniprep kits. DNA sequence was obtained from the 5 1 end of the cDNA insert 
and compared to the National Center for Biotechnology Information (NCBI) 
20 nonredundant database using the BLASTX algorithm. Ma524 was identified as 

a putative desaturase based on DNA sequence homology to previously 
identified desaturases. 

A full-length cDNA clone was isolated from the M. alpina full-length 
library and designed pCGN5532. The cDNA is contained as a 1617 bp insert in 
25 the vector pZLl (BRL) and, beginning with the first ATG, contains an open 

reading frame encoding 457 amino acids. The three conserved f, histidine 
boxes" known to be conserved among membrane-bound deaturases (Okuley, et 
al. (1994) The Plant Cell 6:147-158) were found to be present at amino acid 
positions 172-176, 209-213, and 395-399 (see Figure 3). As with other 
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membrane-bound A6-desaturases the final HXXHH histidine box motif was 
found to be QXXHH. The amino acid sequence of Ma524 was found to display 
significant homology to a portion of a Caenorhabditis elegans cosmid, 
W06D2.4, a cytochrome b5/desaturase fusion protein from sunflower, and the 
5 Synechocystis and Spirulina A 6-desaturases. In addition, Ma524 was shown to 

have homology to the borage A6-desaturase amino sequence (PCT publication 
W) 96/21022). Ma524 thus appears to encode a A6-desaturase that is related to 
the borage and algal A6-desaturases. The peptide sequences are shown as SEQ 
ID NO:5-SEQ IDNO:ll. 

1 0 The amino terminus of the encoded protein was found to exhibit 

significant homology to cytochrome b5 proteins. The Mortierella cDNA clone 
appears to represent a fusion between a cytochrome b5 and a fatty acid 
desaturase. Since cytochrome b5 is believed to function as the electron donor 
for membrane-bound desaturase enzymes, it is possible that the N-terminal 

1 5 cytochrome b5 domain of this desaturase protein is involved in its function. 

This may be advantageous when expressing the desaturase in heterologous 
systems for PUFA production. However, it should be noted that, although the 
amino acid sequences of Ma524 and the borage A6 were found to contain 
regions of homology, the base compositions of the cDNAs were shown to be 

20 significantly different. For example, the borage cDNA was shown to have an 

overall base composition of 60 % A/T, with some regions exceeding 70 %, 
while Ma524 was shown to have an average of 44 % A/T base composition, 
with no regions exceeding 60 %. This may have implications for expressing the 
cDNAs in microorganisms or animals which favor different base compositions. 

25 It is known that poor expression of recombinant genes can occur when the host 

prefers a base composition different from that of the introduced gene. 
Mechanisms for such poor expression include decreased stability, cryptic splice 
sites, and/or translatability of the mRNA and the like. 
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Example 3 

Identification of A6-desaturases Homologous to the 
Mortierella alnina A6-desaturase 

Nucleic acid sequences that encode putative A6-desaturases were 

5 identified through a BLASTX search of the Expressed Sequence Tag ("EST") 

databases through NCBI using the Ma524 amino acid sequence. Several 
sequences showed significant homology. In particular, the deduced amino acid 
sequence of two Arabidopsis thaliana sequences, (accession numbers F 13728 
and T42806) showed homology to two different regions of the deduced amino 

10 acid sequence of Ma524. The following PCR primers were designed: 

ATTS4723-FOR (complementary to F 1 3728) SEQ ID NO: 1 3 
5* CUACUACUACUAGGAGTCCTCTACGGTGTTTTG and 
T42806-REV (complementary to T42806) SEQ ID NO: 14 
5* CAUCAUCAUCAUATGATGCTCAAGCTGAAACTG. Five *xg of total 

15 RNA isolated from developing siliques of Arabidopsis thaliana was reverse 

transcribed using BRL Superscript RTase and the primer TSyn 
(5 , -CCAAGCTTCTGCAGGAGCTCTTTTTTTTTTTTTTT-3 , ) and is shown as 
SEQ ID NO: 12. PCR was carried out in a 50 ul volume containing: template 
derived from 25 ng total RNA, 2 pM each primer, 200 each 

20 deoxyribonucleotide triphosphate, 60 mM Tris-Cl, pH 8.5, 15 mM (NH^SO,*, 

2 mM MgCh, 0.2 U Taq Polymerase. Thermocycler conditions were as 
follows: 94 degrees for 30 sec, 50 degrees for 30 sec, 72 degrees for 30 sec. 
PCR was continued for 35 cycles followed by an additional extension at 72 
degrees for 7 minutes. PCR resulted in a fragment of approximately -750 base 

25 pairs which was subcloned, named 12-5, and sequenced. Each end of this 

fragment was formed to correspond to the Arabidopsis ESTs from which the 
PCR primers were designed. The putative amino acid sequence of 12-5 was 
compared to that of Ma524, and ESTs from human (W28140), mouse 
(W53753), and C. elegans (R05219) (see Figure 4). Homology patterns with 

30 the Mortierella A6- desaturase indicate that these sequences represent putative 
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desaturase polypeptides. Based on this experiment approach, it is likely that the 
full-length genes can be cloned using probes based on the EST sequences. 
Following the cloning, the genes can then be placed into expression vectors, 
expressed in host cells, and their specific A6- or other desaturase activity can be 
5 determined as described below. 

Example 4 

Isolation of a A12-desaturase Nucleotide Sequence from Mortierella alpina 

Based on the fatty acids it accumulates, it seemed probable that 
Mortierella alpina has an co6 type desaturase. The co6-desaturase is responsible 
10 for the production of linoleic acid (18:2) from oleic acid (18:1). Linoleic acid 

(1 8:2) is a substrate for a A6-desaturase. This experiment was designed to 
determine if Mortierella alpina has a A12-desaturase polypeptide, and if so, to 
identify the corresponding nucleotide sequence. 

A random colony from the M. alpina sequencing grade library, Ma648, 
15 was sequenced and identified as a putative desaturase based on DNA sequence 

homology to previously identified desaturases, as described for Ma524 ( see 
Example 2). The nucleotide sequence is shown in SEQ ID NO: 13. The peptide 
sequence is shown in SEQ ID NO:4. The deduced amino acid sequence from 
the 5 1 end of the Ma648 cDNA displays significant homology to soybean 
20 microsomal co6 (A12) desaturase (accession #L43921) as well as castor bean 

oleate 1 2-hydroxylase (accession #U22378). In addition, homology was 
observed when compared to a variety of other co6 (A12) and co3 (Al 5) fatty acid 
desaturase sequences. 
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Example 5 

Expression of M alpina Desaturase Clones in Baker's Yeast 

Yeast Transformation 

Lithium acetate transformation of yeast was performed according to 
5 standard protocols {Methods in Enzymology, Vol. 194, p. 1 86-1 87, 1991). 

Briefly, yeast were grown in YPD at 30°C. Cells were spun down, resuspended 
in TE, spun down again, resuspended in TE containing 1 00 mM lithium acetate, 
spun down again, and resuspended in TE/lithium acetate. The resuspended 
yeast were incubated at 30°C for 60 minutes with shaking. Carrier DNA was 

1 0 added, and the yeast were aliquoted into tubes. Transforming DNA was added, 

and the tubes were incubated for 30 min. at 30°C. PEG solution (35% (w/v) 
PEG 4000, 100 mM lithium acetate, TE pH7.5) was added followed by a 50 
min. incubation at 30°C. A 5 min. heat shock at 42°C was performed, the cells 
were pelleted, washed with TE, pelleted again and resuspended in TE. The 

1 5 resuspended cells were then plated on selective media. 

Desaturase Expression in Transformed Yeast 

cDNA clones from Mortierella alpina were screened for desaturase 
activity in baker's yeast. A canola Al 5-desaturase (obtained by PCR using 1 st 
strand cDNA from Brassica napus cultivar 212/86 seeds using primers based on 

20 the published sequence (Arondel et al Science 258:1353-1355)) was used as a 

positive control. The A 1 5-desaturase gene and the gene from cDNA clones 
Ma524 and Ma648 were put in the expression vector pYES2 (Invitrogen), 
resulting in plasmids pCGR-2, pCGR-5 and pCGR-7, respectively. These 
plasmids were transfected into S. cerevisiae yeast strain 334 and expressed after 

25 induction with galactose and in the presence of substrates that allowed detection 

of specific desaturase activity. The control strain was S. cerevisiae strain 334 
containing the unaltered pYES2 vector. The substrates used, the products 
produced and the indicated desaturase activity were: DGLA (conversion to 
ARA would indicate A5-desaturase activity), linoleic acid (conversion to GLA 
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would indicate A6-desaturase activity; conversion to ALA would indicate A15- 
desaturase activity), oleic acid (an endogenous substrate made by S. cerevisiae, 
conversion to linoleic acid would indicate A12-desaturase activity, which 5. 
cerevisiae lacks), or ARA (conversion to EPA would indicate A17-desaturase 
5 activity). 

Cultures were grown for 48-52 hours at 15°C in the presence of a 
particular substrate. Lipid fractions were extracted for analysis as follows: 
Cells were pelleted by centrifugation, washed once with sterile ddH20, and 
repelleted. Pellets were vortexed with methanol; chloroform was added along 

10 with tritridecanoin (as an internal standard). The mixtures were incubated for at 

least one hour at room temperature or at 4°C overnight. The chloroform layer 
was extracted and filtered through a Whatman filter with one gram of anhydrous 
sodium sulfate to remove particulates and residual water. The organic solvents 
were evaporated at 40°C under a stream of nitrogen. The extracted lipids were 

1 5 then deri vatized to fatty acid methyl esters (FAME) for gas chromatography 

analysis (GC) by adding 2 ml of 0.5 N potassium hydroxide in methanol to a 
closed tube. The samples were heated to 95°C to 100°C for 30 minutes and 
cooled to room temperature. Approximately 2 ml of 14 % boron trifluoride in 
methanol was added and the heating repeated. After the extracted lipid mixture 

20 cooled, 2 ml of water and 1 ml of hexane were added to extract the FAME for 

analysis by GC. The percent conversion was calculated by dividing the product 
produced by the sum of (the product produced and the substrate added) and then 
multiplying by 100. To calculate the oleic acid percent conversion, as no 
substrate was added, the total linoleic acid produced was divided by the sum of 

25 oleic acid and linoleic acid produced, then multiplying by 100. The desaturase 

activity results are provided in Table 1 below. 
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Table 1 

M. alvina Desaturase Expression in Baker's Yeast 



CLONE 


ENZYME ACTIVITY 


% CONVERSION 
OF SUBSTRATE 


pCGR-2 


A6 


0 (18:2 to 18:3 w6) 


(canola A 15 


A15 


16.3 (18:2 to 18:3w3) 


desaturase) 


A5 


2.0 (20:3 to 20:4w6) 




A17 


2 8 (20*4 to 20'5w3^ 




A12 


1.8 (18:1 to 18:2w6) 


pCGR-5 


A6 


6.0 


(M. alpina 


A15 


0 


Ma524 


A5 


2.1 




A J / 


u 




A12 


3.3 


pCGR-7 


A6 


0 


(M. alpina 


A15 


3.8 


Ma648 


A5 


2.2 




A17 


0 




A12 


63.4 



The A15-desaturase control clone exhibited 16.3% conversion of the 
5 substrate. The pCGR-5 clone expressing the Ma524 cDNA showed 6% 

conversion of the substrate to GLA, indicating that the gene encodes a A6- 
desaturase. The pCGR-7 clone expressing the Ma648 cDNA converted 63.4% 
conversion of the substrate to LA, indicating that the gene encodes a A12- 
desaturase. The background (non-specific conversion of substrate) was between 
10 0-3% in these cases. We also found substrate inhibition of the activity by using 

different concentrations of the substrate. When substrate was added to 100 |iM, 
the percent conversion to product dropped compared to when substrate was added 
to 25 |iM (see below). Additionally, by varying the substrate concentration 
between 5 \xM and 200 ^iM, conversion ratios were found to range between about 
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5% to about 75% greater. These data show that desaturases with different 
substrate specificities can be expressed in a heterologous system and used to 
produce poly-unsaturated long chain fatty acids. 

Table 2 represents fatty acids of interest as a percent of the total lipid 
5 extracted from the yeast host S. cerevisiae 334 with the indicated plasmid. No 

glucose was present in the growth media. Affinity gas chromatography was used 
f to separate the respective lipids. GC/MS was employed to verify the identity of 

the product(s). The expected product for the B. napus A15-desaturase, a- 
linolenic acid, was detected when its substrate, linoleic acid, was added 

1 0 exogenously to the induced yeast culture. This finding demonstrates that yeast 

expression of a desaturase gene can produce functional enzyme and detectable 
amounts of product under the current growth conditions. Both exogenously 
added substrates were taken up by yeast, although slightly less of the longer chain 
PUFA, dihomo-Y-Iinolenic acid (20:3), was incorporated into yeast than linoleic 

1 5 acid (1 8:2) when either was added in free form to the induced yeast cultures, y- 

linolenic acid was detected when linoleic acid was present during induction and 
expression of S. cerevisiae 334 (pCGR-5). The presence of this PUFA 
demonstrates A6-desaturase activity from pCGR-5 (MA524). Linoleic acid, 
-identified in the extracted lipids from expression of S. cerevisiae 334 (pCGR-7), 

20 classifies the cDNA MA 648 from M. alpina as the A12-desaturase. 
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Example 6 

Optimization of Culture Conditions 

Table 3A shows the effect of exogenous free fatty acid substrate 
concentration on yeast uptake and conversion to fatty acid product as a 

5 percentage of the total yeast lipid extracted. In all instances, low amounts of 

exogenous substrate (1-10 |iM) resulted in low fatty acid substrate uptake and 
product formation. Between 25 and 50 |iM concentration of free fatty acid in 
the growth and induction media gave the highest percentage of fatty acid 
product formed, while the 1 00 fiM concentration and subsequent high uptake 

1 0 into yeast appeared to decrease or inhibit the desaturase activity. The amount of 

fatty acid substrate for yeast expressing A12-desaturase was similar under the 
same growth conditions, since the substrate, oleic acid, is an endogenous yeast 
fatty acid. The use of a-linolenic acid as an additional substrate for pCGR-5 
(A6) produced the expected product, stearidonic acid (Table 3A). The feedback 

15 inhibition of high fatty acid substrate concentration was well illustrated when 

the percent conversion rates of the respective fatty acid substrates to their 
respective products were compared in Table 3B. In all cases, 100 [iM substrate 
concentration in the growth media decreased the percent conversion to product. 
The uptake of a-linolenic was comparable to other PUFAs added in free form, 

20 while the A6-desaturase percent conversion, 3.8-17.5%, to the product 

stearidonic acid was the lowest of all the substrates examined (Table 3B). The 
effect of media, such as YPD (rich media) versus minimal media with glucose 
on the conversion rate of A12-desaturase was dramatic. Not only did the 
conversion rate for oleic to linoleic acid drop, (Table 3B) but the percent of 

25 linoleic acid formed also decreased by 1 1% when rich media was used for 

growth and induction of yeast desaturase A12 expression (Table 3 A). The 
effect of media composition was also evident when glucose was present in the 
growth media for A6-desaturase, since the percent of substrate uptake was 
decreased at 25 (Table 3 A). However, the conversion rate remained the 
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same and percent product formed decreased for A6-desaturase for in the 
presence of glucose. 

Table 3 A 

5 Effect of Added Substrate on the Percentage of Incorporated 

Substrate and Product Formed in Yeast Extracts 



Plasmid 


pCGR-2 


PcGR-5 


pCGR-5 


pCGR-7 


in Yeast 


(A15) 


(A6) 


(A6) 


(A12) 


Substrate/product 


18:2 /a- 18:3 


18:2/y-18:3 


a-18:3/18:4 


18:1*/18:2 


1 u.M sub. 


ND 


0.9/0.7 


ND 


ND 


10pM sub. 


ND 


4.2/2.4 


10.4/2.2 


ND 


25 uM sub. 


ND 


11/3.7 


1 8.2/2.7 


ND 


25 U.MO sub. 


36.6/7.20 


25.1/10.30 


ND 


6.6/15.80 


50 uM sub. 


53.1/6.50 


ND 


36.2/3 


10.8/13* 


100 uM sub. 


60.1/5.70 


62.4/40 


47.7/1.9 


10/24.8 
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Table 3B 

Effect of Substrate Concentration in Media on the Percent Conversion 



of Fatty Acid Substrate to Product in Yeast Extracts 



Plasmid in Yeast 


pCGR-2 


pCGR-5 


pCGR-5 


pCGR-7 




(A15) 


(A6) 


(A6) 


(A12) 


substrate— ^product 


18:2 -kx-18:3 


18:2->yl8:3 


a-18:3->18:4 


I8:l*->18:2 


1 nM sub. 


ND | 


43.8 


ND 


ND 


10 u,M sub. 


ND 


36.4 


17.5 


ND 


25 jiM sub. 


ND 


25.2 


12.9 


ND 


25 fiMO sub. 


16.40 


29.10 


ND 


70.50 


50 nM sub. 


10.90 


ND 


7.7 


54.6* 


100 uM sub. 


8.70 


60 


3.8 


71.3 



0 no glucose in media 



5 + Yeast peptone broth (YPD) 

* 18:1 is an endogenous yeast lipid 
sub. is substrate concentration 
ND (not done) 



1 0 Table 4 shows the amount of fatty acid produced by a recombinant 

desaturase from induced yeast cultures when different amounts of free fatty acid 
substrate were used. Fatty acid weight was determined since the total amount of 
lipid varied dramatically when the growth conditions were changed, such as the 
presence of glucose in the yeast growth and induction media. To better 

1 5 determine the conditions when the recombinant desaturase would produce the 

most PUFA product, the quantity of individual fatty acids were examined. The 
absence of glucose dramatically reduced by three fold the amount of linoleic 
acid produced by recombinant A12-desaturase. For the A12-desaturase the 
amount of total yeast lipid was decreased by almost half in the absence of 

20 glucose. Conversely, the presence of glucose in the yeast growth media for A6- 

desaturase drops the y-linolenic acid produced by almost half, while the total 
amount of yeast lipid produced was not changed by the presence/absence of 
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glucose. This points to a possible role for glucose as a modulator of A6- 
desaturase activity. 

Table 4 

5 Fatty Acid Produced in jig from Yeast Extracts , 



Plasmid in Yeast 


pCGR-5 


pCGR-5 


pCGR-7 


(enzyme) 


(A6) 


(A6) 


(A12) 


product 


Y-18:3 


18:4 


18:2* 


1 \xM sub. 


1.9 


ND 


ND 


10 uM sub. 


5.3 


4.4 


ND 


25 uM sub. 


10.3 


8.7 


115.7 


25 uM 0 sub. 


29.6 


ND 


39 0 



0 no glucose in media 

sub. is substrate concentration 

ND (not done) 

10 * 1 8: 1 , the substrate, is an endogenous yeast lipid 

Example 7 

Distribution of PUFAs in Yeast Lipid Fractions 

Table 5 illustrates the uptake of free fatty acids and their new products 
formed in yeast lipids as distributed in the major lipid fractions. A total lipid 

15 extract was prepared as described above. The lipid extract was separated on 

TLC plates, and the fractions were identified by comparison to standards. The 
bands were collected by scraping, and internal standards were added. The 
fractions were then saponified and methylated as above, and subjected to gas 
chromatography. The gas chromatograph calculated the amount of fatty acid by 

20 comparison to a standard. The phospholipid fraction contained the highest 

amount of substrate and product PUFAs for A6-desaturase activity. It would 
appear that the substrates are accessible in the phospholipid form to the 
desaturases. 
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Table 5 

Fatty Acid Distribution in Various Yeast Lipid Fractions in p,g 



Fatty acid 
fraction 


Phospholipid 


Diglyceride 


Free Fatty 
Acid 


Triglyceride 


Cholesterol 
Ester 


SC (pCGR-5) 
substrate 18:2 


166.6 


6.2 


15 


18.2 


15.6 


SC (pCGR-5) 
product y-18:3 


61.7 


1.6 


4.2 


5.9 


1.2 



SC = S. cerevisiae (plasm id) 



5 Example 8 

Further Culture Optimization and Coexpression of A6 and A12-desaturases 

This experiment was designed to evaluate the growth and induction 
conditions for optimal activities of desaturases in Saccharomyces cerevisiae. A 
Saccharomyces cerevisiae strain (SC334) capable of producing y-linolenic acid 

1 0 (GL A) was developed, to assess the feasibility of production of PUFA in yeast. 

The genes for A6 and A12-desaturases from M. alpina were coexpressed in 
SC334. Expression of A12-desaturase converted oleic acid (present in yeast) to 
linoleic acid. The linoleic acid was used as a substrate by the A6-desaturase to 
produce GLA. The quantity of GLA produced ranged between 5-8% of the 

15 total fatty acids produced in SC334 cultures and the conversion rate of linoleic 

acid to y-linolenic acid ranged between 30% to 50%. The induction temperature 
was optimized, and the effect of changing host strain and upstream promoter 
sequences on expression of A6 and A12 (MA 524 and MA 648 respectively) 
desaturase genes was also determined. 
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Plasmid Construction 

The cloning of pCGR5 as well as pCGR7 has been discussed above. To 
construct pCGR9a and pCGR9b, the A6 and A12-desaturase genes were 
amplified using the following sets of primers. The primers pRDSl and 3 had 
5 Xhol site and primers pRDS2 and 4 had Xbal site (indicated in bold). These 

primer sequences are presented as SEQ ID NO: 1 5-1 8. 

I. A6-desaturase amplification primers 

a. pRDSl TAC CAA CTC GAG AAA ATG GCT GCT GCT CCC 
AGT GTG AGG 

1 0 b. pRDS2 AAC TG A TCT AG A TTA CTG CGC CTT ACC CAT 

CTT GGA GGC 

II. A12-desaturase amplification primers 

a. pRDS3 TAC CAA CTC GAG AAA ATG GCA CCT CCC 
AAC ACT ATC GAT 

1 5 b. pRDS4 AAC TGA TCT AG A TTA CTT CTT GAA AAA GAC 

CAC GTC TCC 

The pCGR5 and pCGR7 constructs were used as template DNA for 
amplification of A6 and A12-desaturase genes, respectively. The amplified 
products were digested with Xbal and Xhol to create "sticky ends". The PCR 

20 amplified A6-desaturase with Xhol-Xbal ends as cloned into pCGR7, which was 

also cut with Xho-l-Xbal. This procedure placed the A6-desaturase behind the 
A12-desaturase, under the control of an inducible promoter GALl. This 
construct was designated pCGR9a. Similarly, to construct pCGR9b, the A12- 
desaturase with Xhol-Xbal ends was cloned in the Xhol-Xbal sites of pCGR5. 

25 In pCGR9b the A12-desaturase was behind the A6-desaturase gene, away from 

the GAL promoter. 

To construct pCGRl 0, the vector pRS425, which contains the 
constitutive Glyceraldehyde 3-Phosphate Dehydrogenase (GPD) promoter, was 
digested with BamHl and pCGR5 was digested with BamHl-Xhol to release the 
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A6-desaturase gene. This A6-desaturase fragment and BamHl cut pRS425 were 
filled using Klenow Polymerase to create blunt ends and ligated, resulting in 
pCGRlOa and pCGRlOb containing the A6-desaturase gene in the sense and 
antisense orientation, respectively. To construct pCGRl 1 and pCGRl 2, the A6 
5 and A12-desaturase genes were isolated from pCGR5 and pCGR7, respectively, 

using an EcoRl-XhoI double digest. The EcoRl-Xhol fragments of A6 and A 12- 
desaturases were cloned into the pYX242 vector digested with EcoRl-Xhol. 
The pYX242 vector has the promoter of TP1 ( a yeast housekeeping gene), 
which allows constitutive expression. 

10 Yeast Transformation and Expression 

Different combinations of pCGR5, pCGR7, pCGR9a, pCGR9b, 
pCGRlOa, pCGRl 1 and pCGR12 were introduced into various host strains of 
Saccharomyces cerevisiae. Transformation was done using PEG/LiAc protocol 
(Methods in Enzymology Vol. 194 (1991): 186-187). Transformants were 
15 selected by plating on synthetic media lacking the appropriate amino acid. The 

pCGR5, pCGR7, pCGR9a and pCGR9b can be selected on media lacking 
uracil. The pCGRlO, pCGRl 1 and pCGRl 2 constructs can be selected on 
media lacking leucine. Growth of cultures and fatty acid analysis was 
performed as in Example 5 above. 

20 Production of GLA 

Production of GLA requires the expression of two enzymes ( the A6 and 
A12-desaturases), which are absent in yeast. To express these enzymes at 
optimum levels the following constructs or combinations of constructs, were 
introduced into various host strains: 

25 1) pCGR9a/SC334 

2) pCGR9b/SC334 

3) pCGRl 0a and pCGR7/SC334 

4) pCGRll and pCGR7/SC334 

5) pCGR12andpCGR5/SC334 
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6) pCGR10aandpCGR7/DBY746 

7) P CGR10aandpCGR7/DBY746 

The pCGR9a construct has both the A6 and A12-desaturase genes under 
the control of an inducible GAL promoter. The SC334 host cells transformed 
5 with this construct did not show any GLA accumulation in total fatty acids (Fig. 

6A and B, lane 1). However, when the A6 and A12-desaturase genes were 
individually controlled by the GAL promoter, the control constructs were able 
to express A6- and A12-desaturase, as evidenced by the conversion of their 
respective substrates to products. The A12-desaturase gene in pCGR9a was 
1 0 expressed as evidenced by the conversion of 1 8: 1 o>9 to 1 8:2o)6 in 

pCGR9a/SC334, while the A6-desaturase gene was not expressed/active, 
because the 18:2g>6 was not being converted to 1 8:3u>6 (Fig. 6A and B, lane 1). 

The pCGR9b construct also had both the A6 and A12-desaturase genes 
under the control of the GAL promoter but in an inverse order compared to 
1 5 pCGR9a. In this case, very little GLA (<1%) was seen in pCGR9b/SC334 

cultures. The expression of A12-desaturase was also very low, as evidenced by 
the low percentage of 1 8:2o>6 in the total fatty acids (Fig. 6A and B, lane 1). 

To test if expressing both enzymes under the control of independent 
promoters would increase GLA production, the A6-desaturase gene was cloned 

20 into the pRS425 vector. The construct of pCGRlOa has the A6-desaturase in the 

correct orientation, under control of constitutive GPD promoter. The pCGRlOb 
has the A6-desaturase gene in the inverse orientation, and serves as the negative 
control. The pCGR10a/SC334 cells produced significantly higher levels of 
GLA (5% of the total fatty acids, Fig. 6, lane 3), compared to pCGR9a. Both 

25 the A6 and A12-desaturase genes were expressed at high level because the 

conversion of 18:la>9^ 18:2o6 was 65%, while the conversion of 18:2<o6 -> 
18:3o)6 (A6-desaturase) was 30% (Fig. 6, lane 3). As expected, the negative 
control pCGR10b/SC334 did not show any GLA. 

To farther optimize GLA production, the A6 and A12 genes were 
30 introduced into the pYX242 vector, creating pCGRl 1 and pCGR12 
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respectively. The pYX242 vector allows for constitutive expression by the TP1 
promoter (Alber, T. and Kawasaki, G. (1982). J. MoL & Appl Genetics 1: 
419). The introduction of pCGRl 1 and pCGR7 in SC334 resulted in 
approximately 8% of GLA in total fatty acids of SC334. The rate of conversion 
5 of 18:1g>9-> 18:2g>6 and 18:2o>6 -> 18:3g>6 was approximately 50% and 44% 

respectively (Fig. 6A and B, lane 4). The presence of pCGR12 and pCGR5 in 
SC334 resulted in 6.6% GLA in total fatty acids with a conversion rate of 
approximately 50% for both 18:lo>9 to 18:2g>6 and 18:2o)6 to 18:3o6, 
respectively (Fig. 6A and B, lane 5). Thus although the quantity of GLA in 
10 total fatty acids was higher in the pCGRl l/pCGR7 combination of constructs, 

the conversion rates of substrate to product were better for the pCGRl 2/pCGR5 
combination. 

To determine if changing host strain would increase GLA production, 
pCGRlOa and pCGR7 were introduced into the host strain BJ1995 and 

15 DBY746 (obtained from the Yeast Genetic Stock Centre, 1021 Donner 

Laboratory, Berkeley, CA 94720. The genotype of strain DBY746 is Mata, 
his3-Al, leu2-3, leu2-l 12, ura3-32, trpl-289, gal). The results are shown in Fig. 
7. Changing host strain to BJ1995 did not improve the GLA production, 
because the quantity of GLA was only 1 .31% of total fatty acids and the 

20 conversion rate of 1 8: lo>9 — > 1 8:2co6 was approximately 1 7% in BJ1 995. No 

GLA was observed in DBY746 and the conversion of 1 8: lo>9 -> 1 8:2to6 was 
very low (<1% in control) suggesting that a cofactor required for the expression 
of A12-desaturase might be missing in DB746 (Fig. 7, lane 2). 

To determine the effect of temperature on GLA production, SC334 
25 cultures containing pCGRlOa and pCGR7 were grown at 15°C and 30°C. 

Higher levels of GLA were found in cultures grown and induced at 15°C than 
those in cultures grown at 30°C (4.23% vs. 1.68%). This was due to a lower 
conversion rate of 18:2o>6 -> 1 8:3o>6 at 30°C (1 1 .6% vs. 29% in 15°C) cultures, 
despite a higher conversion of 18:lo>9 -> 18:2g>6 (65% vs. 60% at 30°C (Fig. 
30 8). These results suggest that A 12- and A6-desaturases may have different 

optimal expression temperatures. 
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Of the various parameters examined in this study, temperature of 
growth, yeast host strain and media components had the most significant impact 
on the expression of desaturase, while timing of substrate addition and 
concentration of inducer did not significantly affect desaturase expression. 

5 These data show that two DNAs encoding desaturases that can convert 

LA to GLA or oleic acid to LA can be isolated from Mortierella alpina and can 
be expressed, either individually or in combination, in a heterologous system 
and used to produce poly-unsaturated long chain fatty acids. Exemplified is the 
production of GLA from oleic acid by expression of A12- and A6-desaturases in 

10 yeast. 

Example 9 

Identification ofHomologues to M. alpina AS and A6 desaturases 

A nucleic acid sequence that encodes a putative A5 desaturase was 
identified through a TBLASTN search of the expressed sequence tag databases 

15 through NCBI using amino acids 100-446 of Ma29 as a query. The truncated 

portion of the Ma29 sequence was used to avoid picking up homologies based 
on the cytochrome b5 portion at the N-terminus of the desaturase. The deduced 
amino acid sequence of an est from Dictyostelium discoideum (accession # 
C25549) shows very significant homology to Ma29 and lesser, but still 

20 significant homology to Ma524. The DNA sequence is presented as SEQ ID 

NO: 19. The amino acid sequence is presented as SEQ ID NO:20. 

Example 10 

Identification of M. alpina AS and A6 homologues in other 
PUFA-producing organisms 

25 To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Phaeodactylum tricornutum. A 
plasmid-based cDNA librariy was constructed in pSPORTl (GIBCO-BRL) 
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following manufacturer's instructions using a commercially available kit 
(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 
sequences that encode putative A5 or A6 desaturases were identified through 
BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

5 One clone was identified from the Phaeodactylum library with 

homology to Ma29 and Ma524; it is called 144-01 1-B 12. The DNA sequence is 
presented as SEQ ID NO:21. The amino acid sequence is presented as SEQ ID 
NO:22. 

Example 11 

10 Identification of M. alvina A5 and A6 homologues in other 

PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 
was constructed from total RNA isolated from Schizochytrium species. A 
plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 
1 5 1 following manufacturer's instructions using a commercially available kit 
(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 
sequences that encode putative A5 or A6 desaturases were identified through 
BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

One clone was identified from the Schizochytrium library with 
20 homology to Ma29 and Ma524; it is called 81-23-C7. This clone contains a -1 

kb insert. Partial sequence was obtained from each end of the clone using the 
universal forward and reverse sequencing primers. The DNA sequence from 
the forward primer is presented as SEQ ID NO:23. The peptide sequence is 
presented as SEQ ID NO:24. The DNA sequence from the reverse primer is 
25 presented as SEQ ID NO:25. The amino acid sequence from the reverse primer 

is presented as SEQ ID NO:26. 
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Example 12 

Human Desaturase Gene Sequences 

Human desaturase gene sequences potentially involved in long chain 
polyunsaturated fatty acid biosynthesis were isolated based on homology 
5 between the human cDNA sequences and Mortierella alpina desaturase gene 

sequences. The three conserved "histidine boxes'* known to be conserved 
among membrane-bound desaturases were found. As with some other 
membrane-bound desaturases the final HXXHH histidine box motif was found 
to be QXXHH. The amino acid sequence of the putative human desaturases 
10 exhibited homology to M alpina A5, A6, A9, and A 12 desaturases. 

The M. alpina A5 desaturase and A6 desaturase cDNA sequences were 
used to search the LifeSeq database of Incyte Pharmaceuticals, Inc., Palo Alto, 
California 94304. The A5 desaturase sequence was divided into fragments; 1) 
amino acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301- 

15 446. The A6 desaturase sequence was divided into three fragments; 1) amino 

acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301-457. 
These polypeptide fragments were searched against the database using the 
"tblastn" algorithm. This alogarithm compares a protein query sequence against 
a nucleotide sequence database dynamically translated in all six reading frames 

20 (both strands). 

The polypeptide fragments 2 and 3 of M alpina A5 and A6 have 
homologies with the ClonelD sequences as outlined in Table 6. The ClonelD 
represents an individual sequence from the Incyte LifeSeq database. After the 
"tblastn" results have been reviewed, Clone Information was searched with the 

25 default settings of Stringency of >=50, and Productscore <=1 00 for different 

ClonelD numbers. The Clone Information Results displayed the information 
including the ClusterlD, ClonelD, Library, HitID, Hit Description. When 
selected, the ClusterlD number displayed the clone information of all the clones 
that belong in that ClusterlD. The Assemble command assembles all of the 

30 ClonelD which comprise the ClusterlD. The following default settings were 
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used for GCG (Genetics Computer Group, University of Wisconsin 
Biotechnology Center, Madison, Wisconsin 53705) Assembly: 

Word Size: 7 

5 Minimum Overlap: 14 

Stringency: 0.8 

Minimum Identity: 1 4 

Maximum Gap : 10 

Gap Weight: 8 

10 Length Weight: 2 

GCG Assembly Results displayed the contigs generated on the basis of 
sequence information within the ClonelD. A contig is an alignment of DNA 
sequences based on areas of homology among these sequences. A new 

15 sequence (consensus sequence) was generated based on the aligned DNA 

sequences within a contig. The contig containing the ClonelD was identified, 
and the ambiguous sites of the consensus sequence was edited based on the 
alignment of the ClonelDs (see SEQ ID NO:27 - SEQ ID NO:32) to generate 
the best possible sequence. The procedure was repeated for all six ClonelD 

20 listed in Table 6. This produced five unique contigs. The edited consensus 

sequences of the 5 contigs were imported into the Sequencher software program 
(Gene Codes Corporation, Ann Arbor, Michigan 48 105). These consensus 
sequences were assembled. The contig 251 1785 overlaps with contig 3506132, 
and this new contig was called 2535 (SEQ ID NO:33). The contigs from the 

25 Sequencher program were copied into the Sequence Analysis software package 

of GCG. 

Each contig was translated in all six reading frames into protein 
sequences. The Af. alpina A5 (MA29) and A6 (MA524) sequences were 
compared with each of the translated contigs using the FastA search (a Pearson 
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and Lipman search for similarity between a query sequence and a group of 
sequences of the same type (nucleic acid or protein)). Homology among these 
sequences suggest the open reading frames of each contig. The homology 
among the M alpina A5 and A6 to contigs 2535 and 3854933 were utilized to 
5 create the final contig called 253538a. Figure 13 is the FastA match of the final 

contig 253538a and MA29, and Figure 14 is the FastA match of the final contig 
253538a and MA524. The DNA sequences for the various contigs are 
presented in SEQ ID NO:27 -SEQ ID NO:33 The various peptide sequences 
are shown in SEQ ID NO:34 - SEQ ID NO: 40. 

10 Although the open reading frame was generated by merging the two 

contigs, the contig 2535 shows that there is a unique sequence in the beginning 
of this contig which does not match with the contig 3854933. Therefore, it is 
possible that these contigs were generated from independent desaturase like 
human genes. 

15 The contig 253538a contains an open reading frame encoding 432 

amino acids. It starts with Gin (CAG) and ends with the stop codon (TGA). 
The contig 253538a aligns with both M alpina A5 and A6 sequences, 
suggesting that it could be either of the desaturases, as well as other known 
desaturases which share homology with each other. The individual contigs 

20 listed in Table 18, as well as the intermediate contig 2535 and the final contig 

253538a can be utilized to isolate the complete genes for human desaturases. 

Uses of the human desaturases 

These human sequences can be express in yeast and plants utilizing the 
procedures described in the preceding examples. For expression in mammalian 
25 cells transgenic animals, these genes may provide superior codon bias. 

In addition, these sequences can be used to isolate related desaturase 
genes from other organisms. 



Table 6 



Sections of the 


Clone ID from LifeSeq Database 


Keyword 


Desaturases 
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151-300 A5 


3808675 


fatty acid desaturase 


301-446 A5 


354535 


A6 


151-300 A6 


3448789 


A6 


151-300 A6 


1362863 


A6 


151-300 A6 


2394760 


A6 


301-457 A6 


3350263 


A6 



Example 13 

I. INFANT FORMULATIONS 

A, IsomU® Soy Formula with Iron. 

5 Usage: As a beverage for infants, children and adults with an allergy or 

sensitivity to cow's milk. A feeding for patients with disorders for which 
lactose should be avoided: lactase deficiency, lactose intolerance and 
galactosemia. 

Features: 

10 • Soy protein isolate to avoid symptoms of cow's-milk-protein 

allergy or sensitivity 

• Lactose-free formulation to avoid lactose-associated diarrhea 

• Low osmolaity (240 mOsm/kg water) to reduce risk of osmotic 
diarrhea. 

15 • Dual carbohydrates (corn syrup and sucrose) designed to 

enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

• 1.8 mg of Iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

20 • Recommended levels of vitamins and minerals. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 
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Ingredients: (Pareve, ©) 85% water, 4.9% corn syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1 % calcium phosphate tribasic, potassium citrate, potassium 
phosphate monobasic, potassium chloride, mono- and disglycerides, soy 
5 lecithin, carrageenan, ascorbic acid, L-methionine, magnesium chloride, 

potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
10 acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 

selenite, vitamin D3 and cyanocobalamin 

B. Isomil® DF Soy Formula For Diarrhea. 

Usage: As a short-term feeding for the dietary management of diarrhea 
in infants and toddlers. 

15 Features: 

• First infant formula to contain added dietary fiber from soy fiber 
specifically for diarrhea management. 

• Clinically shown to reduce the duration of loose, watery stools 
during mild to severe diarrhea in infants. 

20 • Nutritionally complete to meet the nutritional needs of the infant. 

• Soy protein isolate with added L-methionine meets or exceeds an 
infant's requirement for all essential amino acids. 

• Lactose-free formulation to avoid lactose-associated diarrhea. 

• Low osmolality (240 mOsm/kg water) to reduce the risk of 
25 osmotic diarrhea. 

• Dual carbohydrates (corn syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 
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• Meets or exceeds the vitamin and mineral levels recommended 
by the Committee on Nutrition of the American Academy of Pediatrics 
and required by the Infant Formula Act. 

• 1.8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
5 prevent iron deficiency. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

Ingredients: (Pareve, ®) 86% water, 4.8% corn syrup, 2.5% sugar 
(sucrose), 2.1% soy oil, 2.0% soy protein isolate, 1.4% coconut oil, 0.77% soy 

10 fiber, 0.12% calcium citrate, 0.1 1 % calcium phosphate tribasic, 0.10% 

potassium citrate, potassium chloride, potassium phosphate monobasic, mono- 
and disglycerides, soy lecithin, carrageenan, magnesium chloride, ascorbic acid, 
L-methionine, potassium phosphate dibasic, sodium chloride, choline chloride, 
taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L- 

1 5 carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin A 

palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

C. Isomil® SF Sucrose-Free Soy Formula With Iron. 

20 Usage: As a beverage for infants, children and adults with an allergy or 

sensitivity to cow's-milk protein or an intolerance to sucrose. A feeding for 
patients with disorders for which lactose and sucrose should be avoided. 

Features: 

• Soy protein isolate to avoid symptoms of cow's-milk-protein 
25 allergy or sensitivity. 

• Lactose-free formulation to avoid lactose-associated diarrhea 
(carbohydrate source is Polycose® Glucose Polymers). 

• Sucrose free for the patient who cannot tolerate sucrose. 



-72- 



WO 98/46763 PCT7US98/07126 



• Low osmolality (1 80 mOsm/kg water) to reduce risk of osmotic 
diarrhea. 

• 1.8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

5 • Recommended levels of vitamins and minerals. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

Ingredients: (Pareve, ®) 75% water, 1 1.8% hydrolized cornstarch, 4.1% 
1 0 soy oil, 4. 1 % soy protein isolate, 2.8% coconut oil, 1 .0% modified cornstarch, 

0.38% calcium phosphate tribasic, 0.17% potassium citrate, 0.13% potassium 
chloride, mono- and disglycerides, soy lecithin, magnesium chloride, abscorbic 
acid, L-methionine, calcium carbonate, sodium chloride, choline chloride; 
carrageenan, taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc 
15 sulfate, L-carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin 

A palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

D. Isomil® 20 Soy Formula With Iron Ready To Feed, 
20 20 Cal/H oz. 

Usage: When a soy feeding is desired. 

Ingredients: (Pareve, ®) 85% water, 4.9% corn syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1% calcium phosphate tribasic, potassium citrate, potassium 

25 phosphate monobasic, potassium chloride, mono- and disglycerides, soy 

lecithin, carrageenan, abscorbic acid, L-methionine, magnesium chloride, 
potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 

30 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
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acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D 3 and cyanocobalamin. 

E. Similac® Infant Formula 

Usage: When an infant formula is needed: if the decision is made to 
5 discontinue breastfeeding before age 1 year, if a supplement to breastfeeding is 

needed or as a routine feeding if breastfeeding is not adopted. 

Features: 

• Protein of appropriate quality and quantity for good growth; 
heat-denatured, which reduces the risk of milk-associated enteric blood 

10 loss. 

• Fat from a blend of vegetable oils (doubly homogenized), 
providing essential linoleic acid that is easily absorbed. 

• Carbohydrate as lactose in proportion similar to that of human 
milk. 

15 • Low renal solute load to minimize stress on developing organs. 

• Powder, Concentrated Liquid and Ready To Feed forms. 

Ingredients: (®-D) Water, nonfat milk, lactose, soy oil, coconut oil, 
mono- and diglycerides, soy lecithin, abscorbic acid, carrageenan, choline 
chloride, taurine, m-inositol, alpha-tocopheryl acetate, zinc sulfate, niacinamid, 
20 ferrous sulfate, calcium pantothenate, cupric sulfate, vitamin A palmitate, 

thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
acid, manganese sulfate, phylloquinone, biotin, sodium selenite, vitamin D 3 and 
cyanocobalamin 

F. Similac® NeoCare Premature Infant Formula With Iron 

25 Usage: For premature infants' special nutritional needs after hospital 

discharge. Similac NeoCare is a nutritionally complete formula developed to 
provide premature infants with extra calories, protein, vitamins and minerals 
needed to promote catch-up growth and support development. 

Features: 
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• Reduces the need for caloric and vitamin supplementation. More 
calories (22 Cal/fl oz) then standard term formulas (20 Cal/fl oz). 

• Highly absorbed fat blend, with medium-chain triglycerides 
(MCT oil) to help meet the special digestive needs of premature infants. 

5 • Higher levels of protein, vitamins and minerals per 1 00 Calories 

to extend the nutritional support initiated in-hospital. 

• More calcium and phosphorus for improved bone mineralization. 

Ingredients: ®-D Corn syrup solids, nonfat milk, lactose, whey protein 
concentrate, soy oil, high-oleic safflower oil, fractionated coconut oil (medium- 

1 0 chain triglycerides), coconut oil, potassium citrate, calcium phosphate tribasic, 

calcium carbonate, ascorbic acid, magnesium chloride, potassium chloride, 
sodium chloride, taurine, ferrous sulfate, m-inositol, choline chloride, ascorbyl 
palmitate, L-carnitine, alpha-tocopheryl acetate, zinc sulfate, niacinamide, 
mixed tocopherols, sodium citrate, calcium pantothenate, cupric sulfate, 

15 thiamine chloride hydrochloride, vitamin A palmitate, beta carotene, riboflavin, 

pyridoxine hydrochloride, folic acid, manganese sulfate, phylloquinone, biotin, 
sodium selenite, vitamin D 3 and cyanocobalamin. 

G. Similac Natural Care Low-Iron Human Milk Fortifier Ready 
To Use, 24 Cal/fl oz. 

20 Usage: Designed to be mixed with human milk or to be fed alternatively 

with human milk to low-birth-weight infants. 

Ingredients: ©-D Water, nonfat milk, hydrolyzed cornstarch, lactose, 
fractionated coconut oil (medium-chain triglycerides), whey protein 
concentrate, soil oil, coconut oil, calcium phosphate tribasic, potassium citrate, 
25 magnesium chloride, sodium citrate, ascorbic acid, calcium carbonate, mono- 

and diglycerides, soy lecithin, carrageenan, choline chloride, m-inositol, taurine, 
niacinamide, L-carnitine, alpha tocopheryl acetate, zinc sulfate, potassium 
chloride, calcium pantothenate, ferrous sulfate, cupric sulfate, riboflavin, 
vitamin A palmitate, thiamine chloride hydrochloride, pyridoxine 
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hydrochloride, biotin, folic acid, manganese sulfate, phylloquinone, vitamin D3, 
sodium selenite and cyanocobalamin. 

Various PUFAs of this invention can be substituted and/or added to the 
infant formulae described above and to other infant formulae known to those in 
5 the art.. 

II. NUTRITIONAL FORMULATIONS 

A. ENSURE® 

Usage: ENSURE is a low-residue liquid food designed primarily as an 
oral nutritional supplement to be used with or between meals or, in appropriate 
10 amounts, as a meal replacement. ENSURE is lactose- and gluten- free, and is 

suitable for use in modified diets, including low-cholesterol diets. Although it 
is primarily an oral supplement, it can be fed by tube. 

Patient Conditions: 

• For patients on modified diets 

15 • For elderly patients at nutrition risk 

• For patients with involuntary weight loss 

• For patients recovering from illness or surgery 

• For patients who need a low-residue diet 
Ingredients: 

20 ^-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium and Sodium 

Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, Canola Oil, 
Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, Magnesium 
Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium Chloride, 
Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc Sulfate, 

25 Ferrous Sulfate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 

Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
Riboflavin, Folic Acid, Sodium Molybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate. 
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B. ENSURE® BARS 

Usage: ENSURE BARS are complete, balanced nutrition for 
supplemental use between or with meals. They provide a delicious, nutrient- 
5 rich alternative to other snacks. ENSURE BARS contain <1 g lactose/bar, and 

Chocolate Fudge Brownie flavor is gluten-free. (Honey Graham Crunch flavor 
contains gluten.) 

Patient Conditions: 

• For patients who need extra calories, protein, vitamins and minerals 

10 • Especially useful for people who do not take in enough calories and 

nutrients 

• For people who have the ability to chew and swallow 

• Not to be used by anyone with a peanut allergy or any type of allergy to 
nuts. 

.15 Ingredients: 

Honey Graham Crunch High-Fructose Corn Syrup, Soy Protein 
Isolate, Brown Sugar, Honey, Maltodextrin (Corn), Crisp Rice (Milled Rice, 
Sugar [Sucrose], Salt [Sodium Chloride] and Malt), Oat Bran, Partially 
Hydrogenated Cottonseed and Soy Oils, Soy Polysaccharide, Glycerine, Whey 
20 Protein Concentrate, Polydextrose, Fructose, Calcium Caseinate, Cocoa 

Powder, Artificial Flafors, Canola Oil, High-Oleic Safflower Oil, Nonfat Dry 
Milk, Whey Powder, Soy Lecithin and Corn Oil. Manufactured in a facility that 
processes nuts. 

Vitamins and Minerals: 

25 Calcium Phosphate Tribasic, Potassium Phosphate Dibasic, Magnesium 

Oxide, Salt (Sodium Chloride), Potassium Chloride, Ascorbic Acid, Ferric 
Orthophosphate, Alpha-Tocopheryl Acetate, Niacinamide, Zinc Oxide, Calcium 
Pantothenate, Copper Gluconate, Manganese Sulfate, Riboflavin, Beta- 
Carotene, Pyridoxine Hydrochloride, Thiamine Mononitrate, Folic Acid, Biotin, 



-77- 



WO 98/46763 PCT7US98/07126 



Chromium Chloride, Potassium Iodide, Sodium Selenate, Sodium Molybdate, 
Phylloquinone, Vitamin D 3 and Cyanocobalamin. 

Protein: 

Honey Graham Crunch - The protein source is a blend of soy protein isolate 
5 and milk proteins. 

Soy protein isolate 74% 
Milk proteins 26% 

Fat: 

Honey Graham Crunch - The fat source is a blend of partially 
10 hydrogenated cottonseed and soybean, canola, high oleic safflower, and corn 

oils, and soy lecithin. 

Partially hydrogenated cottonseed and soybean oil 76% 



Canola oil 8% 

High-oleic safflower oil 8% 

15 Corn oil 4% 

Soy lecithin 4% 

Carbohydrate: 



Honey Graham Crunch - The carbohydrate source is a combination of 
high-fructose corn syrup, brown sugar, maltodextrin, honey, crisp rice, 
20 glycerine, soy polysaccharide, and oat bran. 



High-fructose corn syrup 24% 

Brown sugar 21% 

Maltodextrin 12% 

Honey 11% 

25 Crisp rice 9% 

Glycerine 9% 

Soy polysaccharide 7% 

Oat bran 7%\ 
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C. ENSURE® HIGH PROTEIN 

Usage: ENSURE HIGH PROTEIN is a concentrated, high-protein 
liquid food designed for people who require additional calories, protein, 
vitamins, and minerals in their diets. It can be used as an oral nutritional 
5 supplement with or between meals or, in appropriate amounts, as a meal 

replacement. ENSURE HIGH PROTEIN is lactose- and gluten-free, and is 
suitable for use by people recovering from general surgery or hip fractures and 
by patients at risk for pressure ulcers. 

Patient Conditions 

10 • For patients who require additional calories, protein, vitamins, and minerals, 

such as patients recovering from general surgery or hip fractures, patients at risk 
for pressure ulcers, and patients on low-cholesterol diets 

Features- 

• Low in saturated fat 

15 • Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

• Excellent source of protein, calcium, and other essential vitamins and 
minerals 

• For low-cholesterol diets 

20 • Lactose-free, easily digested 

Ingredients: 

Vanilla Supreme: -®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium 
and Sodium Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, 
Canola Oil, Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, 
25 Magnesium Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium 

Chloride, Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc 
Sulfate, Ferrous Suffate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
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Riboflavin, Folio Acid, Sodium Motybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D.3 and 
Cyanocobalarnin. 

Protein: 

5 The protein source is a blend of two high-biologic-value proteins: casein and 

soy. 

Sodium and calcium caseinates 85% 
Soy protein isolate 1 5% 

Fat: 

10 The fat source is a blend of three oils: high-oleic safflower, canola, and soy. 

High-oleic safflower oil 40% 

Canola oil 30% 

Soy oil 30% 

The level of fat in ENSURE HIGH PROTEIN meets American Heart 
1 5 Association (AHA) guidelines. The 6 grams of fat in ENSURE HIGH 

PROTEIN represent 24% of the total calories, with 2.6% of the fat being from 
saturated fatty acids and 7.9% from polyunsaturated fatty acids. These values 
are within the AHA guidelines of < 30% of total calories from fat, < 1 0% of the 
calories from saturated fatty acids, and < 1 0% of total calories from 
20 polyunsaturated fatty acids. 

Carbohydrate: 

ENSURE HIGH PROTEIN contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla supreme, chocolate 
royal, wild berry, and banana), plus VARI-FLAVORSO® Flavor Pacs in pecan, 
25 cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 

patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 60% 
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Maltodextrin 40% 
Chocolate 

Sucrose 70% 
Maltodextrin 30% 
5 \ 
D. ENSURE ® LIGHT 

Usage: ENSURE LIGHT is a low-fat liquid food designed for use as an 
oral nutritional supplement with or between meals. ENSURE LIGHT is 
lactose- and gluten-free, and is suitable for use in modified diets, including low- 
10 cholesterol diets. 

Patient Conditions: 

• For normal-weight or overweight patients who need extra nutrition in a 
supplement that contains 50% less fat and 20% fewer calories than ENSURE 

• For healthy adults who don't eat right and need extra nutrition 
15 Features: 

• Low in fat and saturated fat 

• Contains 3 g of total fat per serving and < 5 mg cholesterol 

• Rich, creamy taste 

• Excellent source of calcium and other essential vitamins and minerals 
20 • For low-cholesterol diets 

• Lactose-free, easily digested 
Ingredients: 

French Vanilla: ®-D Water, Maltodextrin (Corn), Sugar (Sucrose), Calcium 
Caseinate, High-Oleic Safflower Oil, Canola Oil, Magnesium Chloride, Sodium 
25 Citrate, Potassium Citrate, Potassium Phosphate Dibasic, Magnesium Phosphate 

Dibasic, Natural and Artificial Flavor, Calcium Phosphate Tribasic, Cellulose 
Gel, Choline Chloride, Soy Lecithin, Carrageenan, Salt (Sodium Chloride), 
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Ascorbic Acid, Cellulose Gum, Ferrous Sulfate, Alpha-Tocopheryl Acetate, 
Zinc Sulfate, Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric 
Sulfate, Thiamine Chloride Hydrochloride, Vitamin A Palmitate, Pyridoxine 
Hydrochloride, Riboflavin, Chromium Chloride, Folic Acid, Sodium 
5 Molybdate, Biotin, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin 

D3 and Cyanocobalamin. 

Protein: 

The protein source is calcium caseinate. 

Calcium caseinate 1 00% 

10 Fat 

The fat source is a blend of two oils: high-oleic safflower and canola. 
High-oleic safflower oil 70% 
Canola oil 30% 

The level of fat in ENSURE LIGHT meets American Heart Association 
15 (AHA) guidelines. The 3 grams of fat in ENSURE LIGHT represent 13.5% of 

.the total calories, with 1 .4% of the fat being from saturated fatty acids and 2.6% 
from polyunsaturated fatty acids. These values are within the AHA guidelines 
of < 30% of total calories from fat, < 1 0% of the calories from saturated fatty 
acids, and < 1 0% of total calories from polyunsaturated fatty acids. 

20 Carbohydrate 

ENSURE LIGHT contains a combination of maltodextrin and sucrose. 
The chocolate flavor contains corn syrup as well. The mild sweetness and 
flavor variety (French vanilla, chocolate supreme, strawberry swirl), plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
25 orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 51% 
Maltodextrin 49% 
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Chocolate 



Sucrose 



47.0% 



Corn Syrup 



26.5% 



Maltodextrin 



26.5% 



10 



15 



20 



Vitamins and Minerals 

An 8-fl-oz serving of ENSURE LIGHT provides at least 25% of the 
RDIs for 24 key vitamins and minerals. 

Caffeine 

Chocolate flavor contains 2.1 mg caffeine/8 fl oz. 
E. ENSURE PLUS® 

Usage: ENSURE PLUS is a high-calorie, low-residue liquid food for 
use when extra calories and nutrients, but a normal concentration of protein, are 
needed. It is designed primarily as an oral nutritional supplement to be used 
with or between meals or, in appropriate amounts, as a meal replacement. 
ENSURE PLUS is lactose- and gluten-free. Although it is primarily an oral 
nutritional supplement, it can be fed by tube. 

Patient Conditions: 

• For patients who require extra calories and nutrients, but a normal 
concentration of protein, in a limited volume 

• For patients who need to gain or maintain healthy weight 
Features 

• Rich, creamy taste 

• Good source of essential vitamins and minerals 
Ingredients 

Vanilla: ©-D Water, Corn Syrup, Maltodextrin (Corn), Corn Oil, Sodium and 
Calcium Caseinates, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, 
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Potassium Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and 
Artificial Flavor, Sodium Citrate, Potassium Chloride, Choline Chloride, 
Ascorbic Acid, Carrageenan, Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl 
Acetate, Niacinamide, Calcium Pantothenate, Manganese Sulfate, Cupric 
5 Sulfate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 

Riboflavin, Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, 
Sodium Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D3. 

Protein 

10 The protein source is a blend of two high-biologic-value proteins: casein 

and soy. 

Sodium and calcium caseinates 84% 
Soy protein isolate 1 6% 

Fat 

1 5 The fat source is corn oil. 

Corn oil 100% 
Carbohydrate 

ENSURE PLUS contains a combination of maltodextrin and sucrose. 
The mild sweetness and flavor variety (vanilla, chocolate, strawberry, coffee, 
20 buffer pecan, and eggnog), plus VARI-FLAVORS® Flavor Pacs in pecan, 

cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 
patient compliance. 

Vanilla, strawberry, butter pecan, and coffee flavors 



Com Syrup 39% 

25 Maltodextrin 38% 

Sucrose 23% 
Chocolate and eggnog flavors 

Corn Syrup 36% 
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Maltodextrin 34% 

Sucrose 30% 

Vitamins and Minerals 

An 8-fl-oz serving of ENSURE PLUS provides at least 15% of the RDIs 
5 for 25 key Vitamins and minerals. 

Caffeine 

Chocolate flavor contains 3.1 mg Caffeine/8 fl oz. Coffee flavor 
contains a trace amount of caffeine. 



1 0 F. ENSURE PLUS® HN 

Usage: ENSURE PLUS HN is a nutritionally complete high-calorie, 
high-nitrogen liquid food designed for people with higher calorie and protein 
needs or limited volume tolerance. It may be used for oral supplementation or 
for total nutritional support by tube. ENSURE PLUS HN is lactose- and gluten- 
15 free. 

Patient Conditions: 

• For patients with increased calorie and protein needs, such as following 
surgery or injury 

• For patients with limited volume tolerance and early satiety 
20 Features 

• For supplemental or total nutrition 

• For oral or tube feeding 

• 1.5CaVmL 

• High nitrogen 
25 • Calorically dense 

Ingredients 
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Vanilla: ®-D Water, Maltodextrin (Corn), Sodium and Calcium Caseinates, 
Corn Oil, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, Potassium 
Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and Artificial 
Flavor, Sodium Citrate, Choline Chloride, Ascorbic Acid, Taurine, L-Carnitine, 
5 Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 

Carrageenan, Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, 
Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, 
Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, Sodium 
Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
1 0 Cyanocobalamin and Vitamin D3. 



G. ENSURE® POWDER 

Usage: ENSURE POWDER (reconstituted with water) is a low-residue 
liquid food designed primarily as an oral nutritional supplement to be used with 
1 5 or between meals. ENSURE POWDER is lactose- and gluten-free, and is 

suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions: 

• For patients on modified diets 

• For elderly patients at nutrition risk 

20 • For patients recovering from illness/surgery 

• For patients who need a low-residue diet 
Features 

• Convenient, easy to mix 

• Low in saturated fat 

25 • Contains 9 g of total fat and < 5 mg of cholesterol per serving 

• High in vitamins and minerals 

• For low-cholesterol diets 

• Lactose-free, easily digested 
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Ingredients: ®-D Corn Syrup, Maltodextrin (Corn), Sugar (Sucrose), Corn Oil, 
Sodium and Calcium Caseinates, Soy Protein Isolate, Artificial Flavor, 
Potassium Citrate, Magnesium Chloride, Sodium Citrate, Calcium Phosphate 
Tribasic, Potassium Chloride, Soy Lecithin, Ascorbic Acid, Choline Chloride, 
5 Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 

Calcium Pantothenate, Manganese Sulfate, Thiamine Chloride Hydrochloride, 
Cupric Sulfate, Pyridoxine Hydrochloride, Riboflavin, Vitamin A Palmitate, 
Folic Acid, Biotin, Sodium Molybdate, Chromium Chloride, Potassium Iodide, 
Sodium Selenate, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

10 Protein 

The protein source is a blend of two high-biologic-value proteins: casein 
and soy. 

Sodium and calcium caseinates 84% 
Soy protein isolate 16% 

15 Fat 

The fat source is corn oil. 

Corn oil 100% 

Carbohydrate 

ENSURE POWDER contains a combination of corn syrup, 
20 maltodextrin, and sucrose. The mild sweetness of ENSURE POWDER, plus 

VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
orange, helps to prevent flavor fatigue and aid in patient compliance. 



Vanilla 

Corn Syrup 35% 

25 Maltodextrin 35% 

Sucrose 30% 
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H. ENSURE® PUDDING 

Usage: ENSURE PUDDING is a nutrient-dense supplement providing 
balanced nutrition in a nonliquid form to be used with or between meals. It is 
appropriate for consistency-modified diets (e.g., soft, pureed, or full liquid) or 
5 for people with swallowing impairments. ENSURE PUDDING is gluten-free. 

Patient Conditions: 

• For patients on consistency-modified diets (e.g., soft, pureed, or full liquid) 

• For patients with swallowing impairments 
Features 

1 0 • Rich and creamy, good taste 

• Good source of essential vitamins and minerals Convenient-needs no 
refrigeration 

• Gluten-free 

Nutrient Profile per 5 oz: Calories 250, Protein 10.9%, Total Fat 34.9%, 
1 5 Carbohydrate 54.2% 

Ingredients: 

Vanilla: ®-D Nonfat Milk, Water, Sugar (Sucrose), Partially Hydrogenated 
Soybean Oil, Modified Food Starch, Magnesium Sulfate. Sodium Stearoyl 
Lactylate, Sodium Phosphate Dibasic, Artificial Flavor, Ascorbic Acid, Zinc 
20 Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Choline Chloride, 

Niacinamide, Manganese Sulfate, Calcium Pantothenate, FD&C Yellow #5, 
Potassium Citrate, Cupric Sulfate, Vitamin A Palmitate, Thiamine Chloride 
Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, FD&C Yellow #6, Folic 
Acid, Biotin, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

25 Protein 

The protein source is nonfat milk. 

Nonfat milk 100% 
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The fat source is hydrogenated soybean oil. 
Hydrogenated soybean oil 1 00% 

Carbohydrate 

5 ENSURE PUDDING contains a combination of sucrose and modified 

food starch. The mild sweetness and flavor variety (vanilla, chocolate, 
butterscotch, and tapioca) help prevent flavor fatigue. The product contains 9.2 
grams of lactose per serving. 

Vanilla and other nonchocolate flavors 



10 Sucrose 56% 

Lactose 27% 

Modified food starch 17% 
Chocolate 

Sucrose 58% 

15 Lactose 26% 

Modified food starch 1 6% 



I. ENSURE® WITH FIBER 

Usage: ENSURE WITH FIBER is a fiber-containing, nutritionally 
20 complete liquid food designed for people who can benefit from increased 

dietary fiber and nutrients. ENSURE WITH FIBER is suitable for people who 
do not require a low-residue diet. It can be fed orally or by tube, and can be 
used as a nutritional supplement to a regular diet or, in appropriate amounts, as 
a meal replacement. ENSURE WITH FIBER is lactose- and gluten-free, and is 
25 suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions 

• For patients who can benefit from increased dietary fiber and nutrients 
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Features 



10 



15 



20 



• New advanced formula-low in saturated fat, higher in vitamins and minerals 

• Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

• Good source of fiber 

• Excellent source of essential vitamins and minerals 

• For low-cholesterol diets 

• Lactose- and gluten-free 
Ingredients 

Vanilla: ®-D Water, Maltodextrin (Corn), Sugar (Sucrose), Sodium and 
Calcium Caseinates, Oat Fiber, High-Oleic Safflower Oil, Canola Oil, Soy 
Protein Isolate, Corn Oil, Soy Fiber, Calcium Phosphate Tribasic, Magnesium 
Chloride, Potassium Citrate, Cellulose Gel, Soy Lecithin, Potassium Phosphate 
Dibasic, Sodium Citrate, Natural and Artificial Flavors, Choline Chloride, 
Magnesium Phosphate, Ascorbic Acid, Cellulose Gum, Potassium Chloride, 
Carrageenan, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Zinc Sulfate, 
Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric Sulfate, 
Vitamin A Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine 
Hydrochloride, Riboflavin, Folic Acid, Chromium Chloride, Biotin, Sodium 
Molybdate, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D3 and 
Cyanocobalamin. 



The protein source is a blend of two high-biologic-value proteins- casein 
and soy. 



Protein 



Sodium and calcium caseinates 



80% 



Soy protein isolate 



20% 
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Fat 

The fat source is a blend of three oils: high-oleic safflower, canola, and 



corn. 

High-oleic safflower oil 40% 

5 Canolaoil 40% 

Corn oil 20% 



The level of fat in ENSURE WITH FIBER meets American Heart 
Association (AHA) guidelines. The 6 grams of fat in ENSURE WITH FIBER 
represent 22% of the total calories, with 2.01 % of the fat being from saturated 
10 fatty acids and 6.7% from polyunsaturated fatty acids. These values are within 

the AHA guidelines of < 30% of total calories from fat, < 1 0% of the calories 
from saturated fatty acids, and < 1 0% of total calories from polyunsaturated 
fatty acids. 

Carbohydrate 

1 5 ENSURE WITH FIBER contains a combination of maltodextrin and 

sucrose. The mild sweetness and flavor variety (vanilla, chocolate, and butter 
pecan), plus VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, 
lemon, and orange, help to prevent flavor fatigue and aid in patient compliance. 



Vanilla and other nonchocolate flavors 

20 Maltodextrin 66% 

Sucrose 25% 

Oat Fiber 7% 

Soy Fiber 2% 
Chocolate 

25 Maltodextrin 55% 

Sucrose 36% 

Oat Fiber 7% 
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Soy Fiber 2% 

Fiber 

The fiber blend used in ENSURE WITH FIBER consists of oat fiber and 
soy polysaccharide. This blend results in approximately 4 grams of total dietary 
5 fiber per 8-fl-oz can. The ratio of insoluble to soluble fiber is 95:5. 

The various nutritional supplements described above and known to 
others of skill in the art can be substituted and/or supplemented with the PUFAs 
of this invention. 

J. Oxepa™ Nutritional Product 

10 Oxepa is low-carbohydrate, calorically dense enteral nutritional product 

designed for the dietary management of patients with or at risk for ARDS. It 
has a unique combination of ingredients, including a patented oil blend 
containing eicosapentaenoic acid (EPA from fish oil), Y-Hnolenic acid (GLA 
from borage oil), and elevated antioxidant levels. 

15 Caloric Distribution: 

• Caloric density is high at 1.5 Cal/mL (355 Cal/8 fl oz), to minimize the 
volume required to meet energy needs. 



The distribution of Calories in Oxepa is shown in Table 7. 



Table 7. Caloric Distribution of Oxepa 




per 8 fl oz. 


per liter 


%of Cal 


Calories 


355 


1,500 




Fat (g) 


22.2 


93.7 


55.2 


Carbohydrate (g) 


25 


105.5 


28.1 


Protein (g) 


14.8 


62.5 


16.7 


Water (g) 


186 


785 





20 Fat: 

• Oxepa contains 22.2 g of fat per 8-fl oz serving (93.7 g/L). 

• The fat source is a oil blend of 31 .8% canola oil, 25% medium-chain 
triglycerides (MCTs), 20% borage oil, 20% fish oil, and 3.2 % soy lecithin. The 
typical fatty acid profile of Oxepa is shown in Table 8. 
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• Oxepa provides a balanced amount of polyunsaturated, monounsaturated, 
and saturated fatty acids, as shown in Table 10. 

• Medium-chain trigylcerides (MCTs) — 25% of the fat blend — aid gastric 
emptying because they are absorbed by the intestinal tract without 

5 emulsification by bile acids. 

The various fatty acid components of Oxepa™ nutritional product can 
be substituted and/or supplemented with the PUFAs of this invention. 



10 



Table 8. Typical Fatty Acid Profile 




% Total Fatty 
Acids 


g/8 fl oz* 


g/L* 


Caproic (6:0) 


0.2 


0.04 


0.18 


Capryiic (8:0) 


14.69 


3.1 


13.07 


Capric(10:0) 


11.06 


2.33 


9.87 


Palmitic (16:0) 


5.59 


1.18 


4.98 


Palmitoleic (16:ln-7) 


1.82 


0.38 


1.62 


Stearic (18:0) 


1.84 


0.39 


1.64 


Oleic(18:ln-9) 


24.44 


5.16 


21.75 


Linoleic (18:2n-6) 


16.28 


3.44 


14.49 


a-LinoIenic (18:3n-3) 


3.47 


0.73 


3.09 


y-Linolenic (18:3n-6) 


4.82 


1.02 


4.29 


Eicosapentaenoic (20: 5n- 
3) 


5.11 


1.08 


4.55 


n-3-Docosapentaenoic 
(22:5n-3) 


0.55 


0.12 


0.49 


Docosahexaenoic (22:6n- 
3) 


2.27 


0.48 


2.02 


Others 


7.55 


1.52 


6.72 


* Fatty acids equal approximately 95% of total fat. 


Table 9. Fat Profile of Oxepa. 


% of total calories from fat 


55.2 


Polyunsaturated fatty acids 


31.44 g/L 


Monounsaturated fatty acids 


25.53 g/L 


Saturated fatty acids 


32.38 g/L 


n-6 to n-3 ratio 


1.75:1 


Cholesterol 


9.49 mg/8 fl oz 
40.1 mg/L 
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Carbohydrate: 

• The carbohydrate content is 25.0 g per 8-fl-oz serving (105.5 g/L). 

• The carbohydrate sources are 45% maltodextrin (a complex carbohydrate) 
and 55% sucrose (a simple sugar), both of which are readily digested and 

5 absorbed. 

• The high-fat and low-carbohydrate content of Oxepa is designed to 
minimize carbon dioxide (C0 2 ) production. High C0 2 levels can complicate 
weaning in ventilator-dependent patients. The low level of carbohydrate also 
may be useful for those patients who have developed stress-induced 

1 0 hyperglycemia. 

• Oxepa is lactose-free. 

Dietary carbohydrate, the amino acids from protein, and the glycerol 
moiety of fats can be converted to glucose within the body. Throughout this 
process, the carbohydrate requirements of glucose-dependent tissues (such as 

1 5 the central nervous system and red blood cells) are met. However, a diet free of 

carbohydrates can lead to ketosis, excessive catabolism of tissue protein, and 
loss of fluid and electrolytes. These effects can be prevented by daily ingestion 
of 50 to 100 g of digestible carbohydrate, if caloric intake is adequate. The 
carbohydrate level in Oxepa is also sufficient to minimize gluconeogenesis, if 

20 energy needs are being met. 

Protein: 

• Oxepa contains 14.8 g of protein per 8-fl-oz serving (62.5 g/L). 

• The total calorie/nitrogen ratio (150:1) meets the need of stressed patients. 

• Oxepa provides enough protein to promote anabolism and the maintenance 
25 of lean body mass without precipitating respiratory problems. High protein 

intakes are a concern in patients with respiratory insufficiency. Although 
protein has little effect on CO2 production, a high protein diet will increase 
ventilatory drive. 



-94- 



WO 98/46763 



PCT/US98/07126 



• The protein sources of Oxepa are 86.8% sodium caseinate and 13.2% 
calcium caseinate. 

• As demonstrated in Table 1 1 , the amino acid profile of the protein system in 
Oxepa meets or surpasses the standard for high quality protein set by 

5 theNational Academy of Sciences. 

• Oxepa is gluten-free. 

All publications and patent applications mentioned in this specification 
are indicative of the level of skill of those skilled in the art to which this 
1 0 invention pertains. All publications and patent applications are herein 

incorporated by reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated to be incorporated 
by reference. 

The invention now being fully described, it will be apparent to one of 
1 5 ordinary skill in the art that many changes and modifications can be made 

thereto without departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: KNUTZON, DEBORAH 
MURKERJI , PRADIP 
HUANG, YUNG-SHENG 
THURMOND, JENNIFER 
CHAUDHARY, SUNITA 
LEONARD, AMANDA 

(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR SYNTHESIS 
OF LONG CHAIN POLY -UN SATURATED FATTY ACIDS 

(iii) NUMBER OF SEQUENCES: 40 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LIMBACH AND LIMBACH LLP 

(B) STREET: 2001 FERRY BUILDING 

(C) CITY: SAN FRANCISCO 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94111 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS- DOS 

(D) SOFTWARE: Microsoft Word 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) (B) FILING DATE: 
(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: WARD, MICHAEL R. 

(B) REGISTRATION NUMBER: 38,651 

(C) REFERENCE /DOCKET NUMBER: CGAB-210 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 433-4150 

(B) TELEFAX: (415) 433-8716 

(C) TELEX: N/A 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CGACACTCCT TCCTTCTTCT CACCCGTCCT AGTCCCCTTC AACCCCCCTC TTTGACAAAG 60 
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ACAACAAACC ATGGCTGCTG CTCCCAGTGT GAGGACGTTT ACTCGGGCCG AGGTTTTGAA 120 

TGCCGAGGCT CTGAATGAGG GCAAGAAGGA TGCCGAGGCA CCCTTCTTGA TGATCATCGA 180 

5 

CAACAAGGTG TACGATGTCC GCGAGTTCGT CCCTGATCAT CCCGGTGGAA GTGTGATTCT 24 0 

CACGCACGTT GGCAAGGACG GCACTGACGT CTTTGACACT TTTCACCCCG AGGCTGCTTG 300 

10 GGAGACTCTT GCCAACTTTT ' ACGTTGGTGA TATTGACGAG AGCGACCGCG ATATCAAGAA 360 

i TGATGACTTT GCGGCCGAGG TCCGCAAGCT GCGTACCTTG TTCCAGTCTC TTGGTTACTA 420 

\ CGATTCTTCC AAGG CAT ACT ACGCCTTCAA GGTCTCGTTC AACCTCTGCA TCTGGGGTTT 480 

15 

GTCGACGGTC ATTGTGGCCA AGTGGGGCCA GACCTCGACC CTCGCCAACG TGCTCTCGGC 54 0 

TGCGCTTTTG GGTCTGTTCT GGCAGCAGTG CGGATGGTTG GCTCACGACT TTTTGCATCA 600 

20 CCAGGTCTTC CAGGACCGTT TCTGGGGTGA TCTTTTCGGC GCCTTCTTGG GAGGTGTCTG 660 

CCAGGGCTTC TCGTCCTCGT GGTGGAAGGA CAAGCACAAC ACTCACCACG CCGCCCCCAA 720 

CGTCCACGGC GAGGATCCCG ACATTGACAC CCACCCTCTG TTGACCTGGA GTGAGCATGC 7 80 

25 

GTTGGAGATG TTCTCGGATG TCCCAGATGA GGAGCTGACC CGCATGTGGT CGCGTTTCAT 84 0 

GGTCCTGAAC CAGACCTGGT TTTACTTCCC CATTCTCTCG TTTGCCCGTC TCTCCTGGTG 900 

30 CCTCCAGTCC ATTCTCTTTG TGCTGCCTAA CGGTCAGGCC CACAAGCCCT CGGGCGCGCG 960 

TGTGCCCATC TCGTTGGTCG AGCAGCTGTC GCTTGCGATG CACTGGACCT GGTACCTCGC 1020 

CACCATGTTC CTGTTCATCA AGGATCCCGT CAACATGCTG GTGTACTTTT TGGTGTCGCA 1080 

35 

GGCGGTGTGC GGAAACTTGT TGGCGATCGT GTTCTCGCTC AACCACAACG GTATGCCTGT 114 0 

GATCTCGAAG GAG GAG G CG G T C GAT AT GG A TTTCTTCACG AAGC AG AT CA TCACGGGTCG 1200 

40 TGATGTCCAC CCGGGTCTAT TTGCCAACTG GTTCACGGGT GGATTGAACT AT C AG AT CG A 12 60 

GCACCACTTG TTCCCTTCGA TGCCTCGCCA CAACTTTTCA AAGATCCAGC CTGCTGTCGA 1320 

GACCCTGTGC AAAAAGTACA ATGTCCGATA CCACACCACC GGTATGATCG AGGGAACTGC 1380 

45 

AGAGGTCTTT AGCCGTCTGA ACGAGGTCTC CAAGGCTGCC TCCAAGATGG GTAAGGCGCA 14 40 

GTAAAAAAAA AAACAAGGAC GTTTTTTTTC GCCAGTGCCT GTGCCTGTGC CTGCTTCCCT 1500 

50 TGTCAAGTCG AGCGTTTCTG GAAAGGATCG TTCAGTGCAG TATCATCATT CTCCTTTTAC 1560 

CCCCCGCTCA TATCTCATTC ATTTCTCTTA TTAAACAACT TGTTCCCCCC TTCACCG 1617 

^ (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
60 (D) TOPOLOGY: linear 



65 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
15 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
20 25 30 

Leu Met lie lie Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val lie Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 70 75 80 

Ala Asn Phe Tyr Val Gly Asp lie Asp Glu Ser Asp Arg Asp lie Lys 
85 90 95 

Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 " 110 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys lie Trp Gly Leu Ser Thr Val lie Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 

lie Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro lie Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 
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340 345 350 

Asn Gly Met Pro Val lie Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

5 

Phe Thr Lys Gin lie He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
10 385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 
405 410 415 

15 Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 

420 425 430 

He Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

20 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO: 3: 

25 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



40 


GTCCCCTGTC 


GCTGTCGGCA 


CACCCCATCC 


TCCCTCGCTC 


CCTCTGCGTT 


TGTCCTTGGC 


60 


CCACCGTCTC 


TCCTCCACCC 


TCCGAGACGA 


CTGCAACTGT 


AATCAGGAAC 


CGACAAATAC 


120 




ACGATTTCTT 


TTTACTCAGC 


ACCAACTCAA 


AATCCTCAAC 


CGCAACCCTT 


TTTCAGGATG 


180 


45 


GCACCTCCCA 


ACACTATCGA 


TGCCGGTTTG 


ACCCAGCGTC 


ATATCAGCAC 


CTCGGCCCCA 


240 




AACTCGGCCA 


AGCCTGCCTT 


CGAGCGCAAC 


TACCAGCTCC 


CCGAGTTCAC 


CATCAAGGAG 


300 


50 


AT CCG AGAGT 


GCATCCCTGC 


CCACTGCTTT 


GAGCGCTCCG 


GTCTCCGTGG 


TCTCTGCCAC 


360 


GTTGCCATCG 


ATCTGACTTG 


GGCGTCGCTC 


TTGTTCCTGG 


CTGCGACCCA 


GATCGACAAG 


420 




TTTGAGAATC 


CCTTGATCCG 


CTATTTGGCC 


TGGCCTGTTT 


ACTGGATCAT 


GCAGGGTATT 


480 


55 


. GTCTGCACCG 


GTGTCTGGGT 


GCTGGCTCAC 


GAGTGTGGTC 


AT C AGTCCTT 


CTCGACCTCC 


540 




AAGACCCTCA 


ACAACACAGT 


TGGTTGGATC 


TTGCACTCGA 


TGCTCTTGGT 


CCCCTACCAC 


600 


60 


TCCTGGAGAA 


TCTCGCACTC 


GAAGCACCAC 


AAGGCCACTG 


G CC AT AT G AC 


CAAGGACCAG 


660 


GTCTTTGTGC 


CCAAGACCCG 


CTCCCAGGTT 


GGCTTGCCTC 


CCAAGGAGAA 


CGCTGCTGCT 


720 




GCCGTTCAGG 


AGGAGGACAT 


GTCCGTGCAC 


CTGGATGAGG 


AGGCTCCCAT 


TGTGACTTTG 


780 


65 


TTCTGGATGG 


TGATCCAGTT 


CTTGTTCGGA 


TGGCCCGCGT 


AC C T GAT TAT 


GAACGCCTCT 


840 
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GGCCAAGACT ACGGCCGCTG GACCTCGCAC TTCCACACGT ACTCGCCCAT CTTTGAGCCC 900 

CGCAACTTTT TCGACATTAT TATCTCGGAC CTCGGTGTGT TGGCTGCCCT CGGTGCCCTG 960 

ATCTATGCCT CCATGCAGTT GTCGCTCTTG ACCGTCACCA AGTACTATAT TGTCCCCTAC 1020 

CTCTTTGTCA ACTTTTGGTT GGTCCTGATC ACCTTCTTGC AGCACACCGA TCCCAAGCTG 1080 

CCCCATTACC GCGAGGGTGC CTGGAATTTC C AG CG T G GAG CTCTTTGCAC CGTTGACCGC 114 0 

TCGTTTGGCA AGTTCTTGGA CCATATGTTC CACGGCATTG TCCACACCCA TGTGGCCCAT 1200 

CACTTGTTCT CGCAAATGCC GTTCTACCAT GCTGAGGAAG C T AC CT AT C A TCTCAAGAAA 1260 

CTGCTGGGAG AGTACTATGT GTACGACCCA TCCCCGATCG TCGTTGCGGT CTGGAGGTCG 1320 

TTCCGTGAGT GCCGATTCGT GGAGGATCAG GGAGACGTGG TCTTTTTCAA GAAGTAAAAA 1380 

AAAAGACAAT GGACCACACA CAACCTTGTC TCTACAGACC TACGTATCAT GTAGCCATAC 14 40 

CACTTCATAA AAGAACATGA GCTCTAGAGG CGTGTCATTC GCGCCTCC 14 88 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Pro Pro Asn Thr lie Asp Ala Gly Leu Thr Gin Arg His lie 
1 5 10 15 

Ser Thr Ser Ala Pro Asn Ser Ala Lys Pro Ala Phe Glu Arg Asn Tyr 
20 25 30 

Gin Leu Pro Glu Phe Thr lie Lys Glu lie Arg Glu Cys lie Pro Ala 
35 40 45 

His Cys Phe Glu Arg Ser Gly Leu Arg Gly Leu Cys His Val Ala lie 
50 55 60 

Asp Leu Thr Trp Ala Ser Leu Leu Phe Leu Ala Ala Thr Gin lie Asp 
65 70 75 80 

Lys Phe Glu Asn Pro Leu lie Arg Tyr Leu Ala Trp Pro Val Tyr Trp 
85 90 95 

He Met Gin Gly He Val Cys Thr Gly Val Trp Val Leu Ala His Glu 
100 105 110 

Cys Gly His Gin Ser Phe Ser Thr Ser Lys Thr Leu Asn Asn Thr Val 
115 120 125 

Gly Trp He Leu His Ser Met Leu Leu Val Pro Tyr His Ser Trp Arg 
130 135 140 

He Ser His Ser Lys His His Lys Ala Thr Gly His Met Thr Lys Asp 
145 150 155 160 
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Gin Val Phe Val Pro Lys Thr Arg Ser Gin Val Gly Leu Pro Pro Lys 
165 170 175 

Glu Asn Ala Ala Ala Ala Val Gin Glu Glu Asp Met Ser Val His Leu 
180 185 190 

Asp Glu Glu Ala Pro lie Val Thr Leu Phe Trp Met Val lie Gin Phe 
1915 200 205 

Leu Phe Gly Trp Pro Ala Tyr Leu lie Met Asn Ala Ser Gly Gin Asp 
210 215 220 

Tyr Gly Arg Trp Thr Ser His Phe His Thr Tyr Ser Pro lie Phe Glu 
225 230 235 240 

Pro Arg Asn Phe Phe Asp lie lie lie Ser Asp Leu Gly Val Leu Ala 
245 250 255 

Ala Leu Gly Ala Leu lie Tyr Ala Ser Met Gin Leu Ser Leu Leu Thr 
260 265 270 

Val Thr Lys Tyr Tyr He Val Pro Tyr Leu Phe Val Asn Phe Trp Leu 
275 280 285 

Val Leu He Thr Phe Leu Gin His Thr Asp Pro Lys Leu Pro His Tyr 
290 295 300 

Arg Glu Gly Ala Trp Asn Phe Gin Arg Gly Ala Leu Cys Thr Val Asp 
305 310 315 320 

Arg Ser Phe Gly Lys Phe Leu Asp His Met Phe His Gly He Val His 
325 330 335 

Thr His Val Ala His His Leu Phe Ser Gin Met Pro Phe Tyr His Ala 
340 345 350 

Glu Glu Ala Thr Tyr His Leu Lys Lys Leu Leu Gly Glu Tyr Tyr Val 
355 360 365 

Tyr Asp Pro Ser Pro He Val Val Ala Val Trp Arg Ser Phe Arg Glu 
370 375 380 

Cys Arg Phe Val Glu Asp Gin Gly Asp Val Val Phe Phe Lys Lys 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Val Arg Lys Leu Arg Thr Leu Phe Gin Ser Leu Gly Tyr Tyr Asp 
15 10 15 

Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val Ser Phe Asn Leu Cys He 
20 25 30 
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Trp Gly Leu Ser Thr Val He Val Ala Lys Trp Gly Gin Thr Ser Thr 
35 40 45 

Leu Ala Asn Val Leu Ser Ala Ala Leu Leu Gly Leu Phe Trp Gin Gin 
50 55 60 

Cys Gly Trp Leu Ala His Asp Phe Leu His His Gin Val Phe Gin Asp 
65 70 75 80 

Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe Leu Gly Gly Val Cys Gin 
85 90 '95 



Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys His Asn Thr His His Ala 
15 100 105 110 

Ala Pro Asn Val His Gly Glu Asp Pro Asp He Asp Thr His Pro Leu 
115 120 125 

20 Leu Thr Trp Ser Glu His Ala Leu Glu Met Phe Ser Asp Val Pro Asp 

130 135 140 



25 



40 



50 



55 



Glu Glu Leu Thr Arg Met Trp Ser Arg Phe Met Val Leu Asn Gin Thr 

145 150 155 160 

Trp Phe Tyr Phe Pro He Leu Ser Phe Ala Arg Leu Ser Trp Cys Leu 

165 170 175 



Gin Ser lie Leu Phe Val Leu Pro Asn Gly Gin Ala His Lys Pro Ser 
30 180 185 190 

Gly Ala Arg Val Pro He Ser Leu Val Glu Gin Leu Ser Leu Ala Met 

195 200 205 

35 His Trp Thr Trp Tyr Leu Ala Thr Met Phe Leu Phe He Lys Asp Pro 

210 215 220 



Val Asn Met Leu Val Tyr Phe Leu Val Ser Gin Ala Val Cys Gly Asn 
225 230 235 240 

Leu Leu Ala He Val Phe Ser Leu Asn His Asn Gly Met Pro Val He 
245 250 255 



Ser Lys Glu Glu Ala Val Asp Met Asp Phe Phe Thr Lys Gin He He 
45 260 265 270 



Thr Gly Arg Asp Val His Pro Gly Leu Phe Ala Asn Trp Phe Thr Gly 
275 280 285 

Gly Leu Asn Tyr Gin He Glu His His Leu Phe Pro Ser Met Pro Arg 
290 295 300 

His Asn Phe Ser Lys He Gin Pro Ala Val Glu Thr Leu Cys Lys Lys 
305 310 315 ' 320 

Tyr Asn Val Arg Tyr His Thr Thr Gly Met He Glu Gly Thr Ala Glu 
325 330 335 



„ Val Pne Ser Ar 9 Leu Asn Glu Val Ser Lys Ala Ala Ser Lys Met Gly 

60 340 345 350 



Lys Ala Gin 
355 



65 (2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Val Thr Leu Tyr Thr Leu Ala Phe Val Ala Ala Asn Ser Leu Gly Val 
1 5 10 15 

Leu Tyr Gly Val Leu Ala Cys Pro Ser Val Xaa Pro His Gin lie Ala 
20 25 30 

Ala Gly Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr lie Gly Xaa 
35 .40 45 

Asp Ser Gly His Tyr Val lie Met Ser Asn Lys Ser Asn Asn Xaa Phe 
50 55 60 

Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly lie lie Ala Trp Trp 
65 70 75 80 

Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr 
85 90 95 

Gly Pro Asn Leu Gin His lie Pro 
100 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Val Leu Tyr Gly Val Leu Ala Cys Thr Ser Val Phe Ala His Gin 
1 5 10 15 

lie Ala Ala Ala Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr lie 
20 25 30 

Gly His Asp Ser Gly His Tyr Val lie Met Ser Asn Lys Ser Tyr Asn 
35 40 45 

Arg Phe Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly lie Ser lie 
50 55 60 

Ala Trp Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser 
65 70 75 80 

Leu Asp Tyr Asp Pro Asp Leu Gin His lie Pro Val Phe Ala Val Ser 
85 90 95 
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Thr Lys Phe Phe Ser Ser Leu Thr Ser Arg Phe Tyr Asp Arg Lys Leu 
100 105 110 

Thr Phe Gly Pro Val Ala Arg Phe Leu Val Ser Tyr Gin His Phe Thr 
115 120 125 

Tyr Tyr Pro Val Asn Cys Phe Gly Arg lie Asn Leu Phe lie Gin Thr 
130 135 140 

Phe Leu Leu Leu Phe Ser Lys Arg Glu Val Pro Asp Arg Ala Leu Asn 
145 150 155 160 

Phe Ala Gly lie Leu Val Phe Trp Thr Trp Phe Pro Leu Leu Val Ser 
165 170 175 

Cys Leu Pro Asn Trp Pro Glu Arg Phe Phe Phe Val Phe Thr Ser Phe 
180 185 190 

Thr Val Thr Ala Leu Gin His lie Gin Phe Thr Leu Asn His Phe Ala 
195 200 205 

Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp Trp Phe Glu Lys 
210 215 220 

Gin Ala Ala Gly Thr lie Asp lie Ser Cys Arg Ser Tyr Met Asp Trp 
225 230 235 " 240 

Phe Phe Gly Gly Leu Gin Phe Gin Leu Glu His His 
245 250 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Xaa Xaa Asn Phe Ala Gly lie Leu Val Phe Trp Thr Trp Phe Pro 
1 5 10 15 

Leu Leu Val Ser Cys Leu Pro Asn Trp Pro Glu Arg Phe Xaa Phe Val 
20 25 30 

Phe Thr Gly Phe Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu 
35 40 45 

Asn His Phe Ala Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp 
50 55 60 

Trp Phe Glu Lys Gin Ala Ala Gly Thr lie Asp He Ser Cys Arg Ser 
65 7 0 75 80 

Tyr Met Asp Trp Phe Phe Cys Gly Leu Gin Phe Gin Leu Glu His His 
85 90 95 

Leu Phe Pro Arg Leu Pro Arg Cys His Leu Arg Lys Val Ser Pro Val 
100 105 HO 
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Gly Gin Arg Gly Phe Gin Arg Lys Xaa Asn Leu Ser Xaa 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Pro Ala Thr Glu Val Gly Gly Leu Ala Trp Met lie Thr Phe Tyr Val 
1 5 10 15 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
20 25 30 

Gly Leu Phe Phe lie Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
35 40 45 

Val Thr Gin Met Asn His lie Pro Met His lie Asp His Asp Arg Asn 
50 55 60 

Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 
65 70 .75 80 

' Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin lie Glu 
85 90 95 

His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Xaa Val Ala 
100 105 110 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly lie Glu Tyr Gin Ser 
115 120 125 

Lys Pro Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Ser Pro Lys Ser Ser Pro Thr Arg Asn Met Thr Pro Ser Pro Phe 
15 10 15 

lie Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
20 25 30 
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Phe Pro Thr Met Pro Arg Cys Asn Leu Asn Arg Cys Met Lys Tyr Val 
35 4 0 4 5 

Lys Glu Trp Cys Ala Glu Asn Asn Leu Pro Tyr Leu Val Asp Asp Tyr 
50 55 60 

Phe Val Gly Tyr Asn Leu Asn Leu Gin Gin Leu Lys Asn Met Ala Glu 
65 70 7 5 80 



Leu Val Gin Ala Lys Ala Ala 
85 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Arg His Glu Ala Ala Arg Gly Gly Thr Arg Leu Ala Tyr Met Leu Val 
1 5 10 15 

Cys Met Gin Trp Thr Asp Leu Leu Trp Ala Ala Ser Phe Tyr Ser Arg 
20 25 30 

Phe Phe Leu Ser Tyr Ser Pro Phe Tyr Gly Ala Thr Gly Thr Leu Leu 
35 40 45 



Leu Phe Val Ala Val Arg Val Leu 

50 55 

Thr Gin Met Asn His He Pro Lys 
65 70 

Asp Trp Ala Ser Ser Gin Leu Ala 
85 

Leu Phe He Asp Trp Phe Ser Gly 
100 

His Leu Phe Pro Thr Met Thr Arg 

115 120 

Leu Val Lys Ala Phe Cys Ala Lys 

130 135 



Glu Ser His Trp Phe Val Trp He 
60 

Glu He Gly His Glu Lys His Arg 
75 80 

Ala Thr Cys Asn Val Glu Pro Ser 
90 95 

His Leu Asn Phe Gin He Glu His 
105 110 

His Asn Tyr Arg Xaa Val Ala Pro 
125 

His Gly Leu His Tyr Glu Val 
140 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCAAGCTTCT GCAGGAGCTC TTTTTTTTTT TTTTT 35 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CUACUACUAC UAGGAGTCCT CTACGGTGTT TTG 33 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAUCAUCAUC AU AT GAT G CT CAAGCTGAAA CTG 33 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TACCAACTCG AGAAAATGGC TGCTGCTCCC AGTGTGAGG 39 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AACTGATCTA GAT TACT GCG CCTTACCCAT CTTGGAGGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 
TACCAACTCG AGAAAATGGC ACCTCCCAAC ACTATCGAT 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AACTGATCTA GATTACTTCT TGAAAAAGAC CACGTCTCC 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGTATGTCAC T CC ATT C C AA ACT CGTT CAT GGTATCATAA ATATCAACAC ATTTACGCTC 
CACTCCTCTA TGGTATTTAC ACACTCAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 
AGGATGGTAA AAATGGTGCA ATTCGTGTTA GTGTCGCCAC AAATTTCGAT AAGGCCGCTT 
ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 
GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 
CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGATCTCAA ATTCTTTGCT ACCCCTGAAA 
GACCAGATGA AC CAT CT C AA ATCAATGAAG ATTGGGCAAT CCTTCAACTT AAAACT ACT C 
AAGATTATGG TCATGGTTCA CTCCTTTGTA CCTTTTTTAG TGGTTCTTTA AATCATCAAG 
TTGTTCATCA TTTATTCCCA TCAATTGCTC AAGATTTCTA CCCACAACTT GTACCAATTG 
TAAAAGAAGT TTGTAAAGAA CATAACATTA CTT AC CAC AT TAAACCAAAC TTCACTGAAG 
CTATTATGTC ACACATTAAT TACCTTTACA AAATGGGTAA TGATCCAGAT TATGTTAAAA 
AAC C ATT AG C CT CAAAAG AT GATTAAATGA AATAACTTAA AAACCAATTA TTTACTTTTG 



-108- 



ACAAACAGTA ATATTAATAA ATACAA 



(2) INFORMATION FOR SEQ ID NO: 20: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



Tyr 


Val 


Thr 


Pro 


Phe 


Gin 


Thr 


Arg 


Ser 


Trp 


Tyr 


His 


Lys 


Tyr 


Gin 


1 








5 










10 










15 


His 


He 


Tyr 


Ala 


Pro 
20 


Leu 


Leu 


Tyr 


Gly 


He 
25 


Tyr 


Thr 


Leu 


Lys 


Tyr 
30 


Arg 


Thr 


Gin 


Asp 


Trp 
35 


Glu 


Ala 


Phe 


Val 


Lys 
40 


Asp 


Gly 


Lys 


Asn 


Gly 
45 


Ala 


He 


Arg 


Val 


Ser 
50 


Val 


Ala 


Thr 


Asn 


Phe 
55 


Asp 


Lys 


Ala 


Ala 


Tyr 
60 


Val 


He 


Gly 


Lys 


Leu 
65 


Ser 


Phe 


Val 


Phe 


Phe 
70 


Arg 


Phe 


He 


Leu 


Pro 
75 


Leu 


Arg 


Tyr 


His 


Ser 
80 


Phe 


Thr 


Asp 


Leu 


He 
85 


Cys 


Tyr 


Phe 


Leu 


He 
90 


Ala 


Glu 


Phe 


Val 


Phe 
95 


Gly 


Trp 


Tyr 


Leu 


Thr 
100 


He 


Asn 


Phe 


Gin 


Val 
105 


Ser 


His 


Val 


Ala 


Glu 
110 


Asp 


Leu 


Lys 


Phe 


Phe 
115 


Ala 


Thr 


Pro 


Glu 


Arg 
120 


Pro 


Asp 


Glu 


Pro 


Ser 
125 


Gin 


He 


Asn 


Glu 


Asp 
130 


Trp 


Ala 


He 


Leu 


Gin 
135 


Leu 


Lys 


Thr 


Thr 


Gin 
140 


Asp 


Tyr 


Gly 


His 


Gly 
145 


Ser 


Leu 


Leu 


Cys 


Thr 
150 


Phe 


Phe 


Ser 


Gly 


Ser 
155 


Leu 


Asn 


His 


Gin 


Val 
160 


Val 


His 


His 


Leu 


Phe 
165 


Pro 


Ser 


He 


Ala 


Gin 
170 


Asp 


Phe 


Tyr 


Pro 


Gin 
175 


Leu 


Val 


Pro 


He 


Val 
180 


Lys 


Glu 


Val 


Cys 


Lys 
185 


Glu 


His 


Asn 


He 


Thr 
190 


Tyr 


His 


He 


Lys 


Pro 
195 


Asn 


Phe 


Thr 


Glu 


Ala 


He 


Met 


Ser 


His 


He 


Asn 


Tyr 


Leu 


Tyr 


Lys 










200 










205 






210 


Met 


Gly 


Asn 


Asp 


Pro 
215 


Asp 


Tyr 


Val 


Lys 


Lys 
220 


Pro 


Leu 


Ala 


Ser 


Lys 
225 



Asp Asp *** 



(2) INFORMATION FOR SEQ ID NO 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 



TTTTGGAAGG NTCCAAGTTN 
CCCCCCAAGC CTTTTGTCGA 
TTATTCCCCA GCCTGCCCCG 
TGCAAGGAGT GGGGTGTCCA 
TTGCACCATT TGGGCAGCGT 



ACCACGGANT NGGCAAGTTN 
CTGGTTCTGT GGTGGCTTCC 
ACACAATCTG GCCAAGACAC 
GTACCACGAA GCCGACCTCG 
GGCCGGCGAA TTCGTCGTGG 



ACGGGGCGGA AANCGGTTTT 
AGTACCAAGT CGACCACCAC 
ACGCACTGGT CGAATCGTTC 
TGGACGGGAC CATGGAAGTC 
ATTTTGTACG CGACGGACCC 
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GCCATGTAAT CGTCGTTCGT 
ACACAACTAG TGTAACTCGT 
GGGATAGGGT AGGTAGGCGG 
GCCCGCGTNA AAGT 



GACGATGCAA GGGTTCACGC 
ATAGAATTCG GTGTCGACCT 
ACGCGTGGGT CGNCCCCGGG 



ACATCTACAC ACACTCACTC 
GGACCTTGTT TGACTGGTTG 
AATT CTGTGA CCGGTACCTG 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Phe 


Trp 


Lys 


Xxx 


Pro 


Ser 


Xxx 


Pro 


Arg 


Xxx 


Xxx 


Gin 


Val 


Xxx 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Xxx 


Gly 


Phe 


Pro 


Pro 


Lys 


Pro 


Phe 


Val 


Asp 


Trp 


Phe 


Cys 










20 










25 










30 


Gly 


Gly 


Phe 


Gin 


Tyr 


Gin 


Val 


Asp 


His 


His 


Leu 


Phe 


Pro 


Ser 


Leu 










35 










40 










45 


Pro 


Arg 


His 


Asn 


Leu 


Ala 


Lys 


Thr 


His 


Ala 


Leu 


Val 


Glu 


Ser 


Phe 










50 










55 










60 


Cys 


Lys 


Glu 


Trp 


Gly 


Val 


Gin 


Tyr 


His 


Glu 


Ala 


Asp 


Leu 


Val 


Asp 










65 










70 










75 


Gly Thr 


Met 


Glu 


Val 


Leu 


His 


His 


Leu 


Gly 


Ser 


Val 


Ala 


Gly Glu 










65 










70 










75 


Phe 


Val 


Val 


Asp 


Phe 


Val 


Arg 


Asp 


Gly 


Pro 


Ala 


Met 









80 85 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 nucleic acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



GGATGGAGTT CGTCTGGATC 
CTTGGGTACA CGCCGGGGCA 
ATTTACATTT TTCTGCAGTT 
GATCAGCTGC ATTGGCTCGA 
GGTTTGTCAC ATGGTGGATG 
CGGCGCCCCA GTTCCGTTTC 
ACGGTCTCCC TTACTACGAC 
TCTACTCCGT CGGCCATTCC 
TTAATTCCCC ACCCCACCCC 



GCTGTGCGCT ACGCGACGTG 
GTCGTTGGGC ATGTACTTGT 
CGCCGTAAGT CACACCCATT 
GTACGCGCGG ACCACACTGT 
TCGAACCTCA ACTTTCAGAT 
AAGGAGATCA GCCCGCGCGT 
ATGCCCTACA CGAGCGCCGT 
GTCGGCGACG CCAAGCGCGA 
ATGTTCTGTC TTCCTCCCGC 



GTTTAAGCGT CATGGGTGCG 
GCGCCTTTGG TCTCGGCTGC 
TGCCCGTGAG CAACCCGGAG 
GAACATCAGC ACCAAGTCGT 
CGAGCACCAC CTTTTCCCCA 
CGAGGCCCTC TTCAAGCGCC 
CTCCACCACC TTTGCCAACC 
CTAGCCTCTT TTCCTAGACC 



(2) INFORMATION FOR SEQ ID NO:24: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met Glu Phe Val Trp lie Ala 
1 5 
Arg His Gly Cys Ala Trp Val 
20 

Val Leu Val Arg Leu Trp Ser 
35 

Val Arg Arg Lys Ser His Pro 
50 

Ser Ala Ala Leu Ala Arg Val 
65 

Ser Thr Lys Ser Trp Phe Val 
80 

Phe Gin lie Glu His His Leu 
95 

Phe Lys Glu He Ser Pro Arg 
110 

Gly Leu Pro Tyr Tyr Asp Met 
125 

Thr Phe Ala Asn Leu Tyr Ser 
140 

Lys Arg Asp 



Val 


Arg 


Tyr 


Ala 


Thr 


Trp 


Phe 


Lys 






10 










15 


His 


Ala 


Gly 


Ala 


Val 


Val 


Gly 


His 






25 










30 


Arg 


Leu 


His 


Leu 


His 


Phe 


Ser 


Ala 






40 










45 


Phe 


Ala 


Arg 


Glu 


Gin 


Pro 


Gly 


Gly 






55 










60 


Arg 


Ala 


Asp 


His 


Thr 


Val 


Asn 


He 






70 










75 


Thr 


Trp 


Trp 


Met 


Ser 


Asn 


Leu 


Asn 






85 










90 


Phe 


Pro 


Thr 


Ala 


Pro 


Gin 


Phe 


Arg 






100 










105 


Val 


Glu 


Ala 


Leu 


Phe 


Lys 


Arg 


His 






115 










120 


Pro 


Tyr 


Thr 


Ser 


Ala 


Val 


Ser 


Thr 






130 










135 


Val 


Gly 


His 


Ser 


Val 


Gly 


Asp 


Ala 






145 










150 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ACGCGTCCGC CCACGCGTCC GCCGCGAGCA ACT CAT CAAG GAAGGCTACT TTGACCCCTC 

GCTCCCGCAC ATGACGTACC GCGTGGTCGA GATTGTTGTT CTCTTCGTGC TTTCCTTTTG 

GCTGATGGGT CAGTCTTCAC CCCTCGCGCT CGCTCTCGGC ATTGTCGTCA GCGGCATCTC 

TCAGGGTCGC TGCGGCTGGG TAATGCATGA GATGGGCCAT GGGTCGTTCA CTGGTGTCAT 

TTGGCTTGAC GACCGGTTGT GCGAGTTCTT TTACGGCGTT GGTTGTGGCA TGAGCGGTCA 

TTACTGGAAA AACCAGCACA GCAAACACCA CGCAGCGCCA AACCGGCTCG AGCACGATGT 

AGATCTCAAC ACCTTGCCAT TGGTGGCCTT CAACGAGCGC GTCGTGCGCA AGGTCCGACC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Arg 


Val Arg 


Pro 


Arg 


Val 


Arg 


Arg Glu 


Gin 


Leu 


He 


Lys 


Glu 


Gly 


1 






5 








10 










15 


Tyr 


Phe Asp 


Pro 


Ser 


Leu 


Pro 


His Met 


Thr 


Tyr 


Arg 


Val 


Val 


Glu 








20 








25 










30 


lie 


Val Val 


Leu 


Phe 


Val 


Leu 


Ser Phe 


Trn 


Leu 


Met 


Gly 


Gin 


Ser 








35 








40 










45 


Ser 


Pro Leu 


Ala 


Leu 


Ala 


Leu 


Gly He 


Val 


Val 


Ser 


Gly 


He 


Ser 








50 








55 








60 


Gin 


Gly Arg 


Cys 


Gly 


Trp 


Val 


Met His 


Glu 


Met 


Gly 


His 


Gly 


Ser 








65 








70 










75 


Phe 


Thr Gly 


Val 


He 


Trp 


Leu 


Asp Asp 


Arg 


Leu 


Cys 


Glu 


Phe 


Phe 








65 








70 








75 


Tyr 


Gly Val 


Gly 


Cys 


Gly 


Met 


Ser Gly 


His 


Tyr 


Trp 


Lys 


Asn 


Gin 








80 








85 










90 


His 


Ser Lys 


His 


His 


Ala 


Ala 


Pro Asn 


Arg 


Leu 


Glu 


His 


Asp 


Val 








95 








100 










105 


Asp 


Leu Asn 


Thr 


Leu 


Pro 


Leu 


Val Ala 


Phe 


Asn 


Glu 


Arg 


Val 


Val 








110 








115 








120 


Arg 


Lys Val 


Arg 


Pro 


























125 




















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:27: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



GCACGCCGAC 


CGGCGCCGGG 


AGATCCTGGC 


AAAGT AT C C A 


GAGATAAAGT 


CCTTGATGAA 


ACCTGATCCC 


AATT T GAT AT 


GGATTATAAT 


TATGATGGTT 


CTCACCCAGT 


TGGGTGCATT 


TTACATAGTA 


AAAGACTTGG 


ACTGGAAATG 


GGTCATATTT 


GGGGCCTATG 


CGTTTGGCAG 


TTGCATTAAC 


CACTCAATGA 


CTCTGGCTAT 


T CAT GAG ATT 


GCCCACAATG 


CTGCCTTTGG 


CAACTGCAAA 


GCAATGTGGA 


ATCGCTGGTT 


TGGAATGTTT 


GCTAATCTTC 


CTATTGGGAT 


T C CAT ATT C A 


ATTTCCTTTA 


AGAGGTATCA 


CAT G GAT CAT 


CATCGGTACC 


TTGGAGCTGA 


TGGCGTCGAT 


GTAGATATTC 


CTACCGATTT 


TGAGGGCTGG 


TTCTTCTGTA 


CCGCTTTCAG 


AAAGTTTATA 


TGGGTTATTC 


TTCAGCCTCT 


CTTTTATGCC 


TTTCGACCTC 


TGTTCATCAA 


CCCCAAACCA 


AT T AC GT AT C 


TGGAAGTTAT 


CAATACCGTG 


GCACAGGTCA 


CTTTTGACAT 


TTTAATTTAT 


TACTTTTTGG 


GAATTAAATC 


CTTAGTCTAC 


ATGTTGGCAG 


CATCTTTACT 


TGGCCTGGGT 


TTGCACCCAA 


TTTCTGGACA 


TTTTATAGCT 


GAGCATTACA 


TGTTCTTAAA 


GGGTCATGAA 


ACTTACTCAT 


ATTATGGGCC 


TCTGAATTTA 


CTTACCTTCA 


ATGTGGGTTA 


TCATAATGAA 


CATCATGATT 


TCCCCAACAT 


TCCTGGAAAA 


AGTCTTCCAC 


TGGTGAGGAA 


AATAGCAGCT 


GAATACTATG 


ACAACCTCCC 


TCACTACAAT 


TCCTGGATAA 


AAGTACTGTA 



-112- 



TGATTTTGTG ATGGATGATA CAATAAGTCC CTACTCAAGA ATGAAGAGGC ACCAAAAAGG 900 

AGAGATGGTG CTGGAGTAAA TATCATTAGT GCCAAAGGGA TTCTTCTCCA AAACTTTAGA 960 

TGATAAAATG GAATTTTTGC ATTATTAAAC T T GAG AC C AG T GAT G CT C AG AAGCTCCCCT 1020 

GGCACAATTT CAGAGTAAGA GCTCGGTGAT ACCAAGAAGT GAATCTGGCT TTTAAACAGT 1080 

CAGCCTGACT CTGTACTGCT CAGTTTCACT CACAGGAAAC TTGTGACTTG TGTATTATCG 114 0 

TCATTGAGGA TGTTTCACTC ATGTCTGTCA TTTTATAAGC ATATCATTTA AAAAGCTTCT 1200 

AAAAAGCTAT TTCGCCAGG 1219 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



TTACCTTCTA 


CGTCCGCTTC 


TTCCTCACTT 


ATGTGCCACT 


ATTGGGGCTG 


AAAGCTTCCT 


60 


GGGCCTTTTC 


TTCATAGTCA 


GGTTCCTGGA 


AAGCAACTGG 


TTTGTGTGGG 


T G AC AC AG AT 


120 


G AACC AT AT T 


CCCATGCACA 


TTGATCATGA 


CCGGAACATG 


GACTGGGTTT 


CCACCCAGCT 


180 


CCAGGCCACA 


TGCAATGTCC 


ACAAGTCTGC 


CTTCAATGAC 


TGGTTCAGTG 


GACACCTCAA 


240 


CTTCCAGATT 


GAGCACCATC 


TTTTTCCCAC 


GATGCCTCGA 


CACAATTACC 


ACAAAGTGGC 


300 


TCCCCTGGTG 


CAGTCCTTGT 


GTGCCAAGCA 


T G G CAT AG AG 


TACCAGTCCA 


AGCCCCTGCT 


360 


GTCAGCCTTC 


GCCGACATCA 


TCCACTCACT 


AAAGGAGTCA 


GGGCAGCTCT 


GGCTAGATGC 


420 


CTATCTTCAC 


CAATAACAAC 


AGCCACCCTG 


CCCAGTCTGG 


AAGAAGAGGA 


GGAAGACTCT 


480 


GGAGCCAAGG 


CAGAGGGGAG 


CTTGAGGGAC 


AATGCCACTA 


TAGTTTAATA 


CTCAGAGGGG 


540 


GTTGGGTTTG 


GGGACATAAA 


GCCTCTGACT 


CAAACTCCTC 


CCTTTTATCT 


TCTAGCCACA 


600 


GTTCTAAGAC 


CCAAAGTGGG 


GGGTGGACAC 


AGAAGTCCCT 


AGGAGGGAAG 


GAGCT 


655 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

-113- 
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TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

^ CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 24 0 

CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 

10 AAGA 304 

(2) INFORMATION FOR SEQ ID NO: 30: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
25 CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG T AC AAC AT C A GCGAGTTCAC CCGCCGGCAT 120 
CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

30 

GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 24 0 
CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 
35 CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 3 60 

CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 4 20 
TTTGGGACGT CCTTTTTGCC CTTCCTCCTC TGTGCGGTGC TGCTCAGTGC AGTTCAGGCC 4 80 

40 

CAGGCTGGCT GGCTGCAGCA TGACTTTGGG CACCTGTCGG TCTTCAGCAC CTCAAAGTGG 54 0 
AACCATCTGC TACATCATTT TGTGATTGGC CACCTGAAGG GGGCCCCCGC CAGTTGGTGG 600 

45 AACCACATGC ACTTCCAGCA CCATGCCAAG CCCAACTGCT TCCGCAAAGA CCCAGACATC 660 

AAC ATG CAT C CCTTCTTCTT TGCCTTGGGG AAGATCCTCT CTGTGGAGCT TGGGAAACAG 7 20 

^ AAGAAAAAAT ATATGCCGTA CAACCACCAG CACARATACT TCTTCCTAAT TGGGCCCCCA 7 80 

GCCTTGCTGC CTCTCTACTT CCAGTGGTAT ATTTTCTATT TTGTTATCCA GCGAAAGAAG 84 0 
TGGGTGGACT TGGCCTGGAT CAGCAAACAG GAATACGATG AAGCCGGGCT TCCATTGTCC 900 

55 ACCGCAAATG CTTCTAAA 



(2) INFORMATION FOR SEQ ID NO: 31: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



65 



918 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2511785) 
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10 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GCCACTTAAA GGGTGCCTCT GCCAACTGGT GGAATCATCG CCACTTCCAG CACCACGCCA 60 

AGCCTAACAT CTTCCACAAG GATCCCGATG TGAACATGCT GCACGTGTTT GTTCTGGGCG 120 

AATGGCAGCC CAT CG AG T A C GGCAAGAAGA AG CTG AAAT A CCTGCCCTAC AATCACCAGC 180 

ACGAATACTT CTTCCTGATT GGGCCGCCGC TGCTCATCCC CATGTATTTC CAGTACCAGA 24 0 

T CAT CAT G AC CATGATCGTC CATAAGAACT GGGTGGACCT GGCCTGGGCC GTCAGCTACT 300 

15 ACATCCGGTT CTTCATCACC TACATCCCTT TCTACGGCAT CCTGGGAGCC CTCCTTTTCC 3 60 

TCAACTTCAT CAGGTTCCTG GAGAGCCACT GGTTTGTGTG GGTCACACAG ATGAATCACA 4 20 

TCGTCATGGA G AT T G AC C AG GAGGCCTACC GTGACTGGTT CAGTAGCCAG C T G AC AG C C A .4 80 

CCTGCAACGT GGAGCAGTCC TTCTTCAACG ACTGGTTCAG TGGACACCTT AACTTCCAGA 54 0 

TTGAGCACCA CCTCTTCCCC ACCATGCCCC GGCACAACTT AC AC AAG AT C GCCCCGCTGG 600 

25 TGAAGTCTCT ATGTGCCAAG CATGGCATTG AATACCAGGA GAAGCCGCTA CTGAGGGCCC 660 

TGCTGGACAT CAT C AG GT C C CTGAAGAAGT CTGGGAAGGT GTGGCTGGAC GCCTACCTTC 720 

ACAAAT G AAG CCACAGCCCC CGGGACACCG TGGGGAAGGG GTGCAGGTGG GGTGATGGCC 7 80 

30 

AGAGGAATGA TGGGCTTTTG TTCTGAGGGG T G T C CG AG AG GCTGGTGTAT GCACTGCTCA 8 40 

CGGACCCCAT GTTGGATCTT TCTCCCTTTC TCCTCTCCTT TTTCTCTTCA CATCTCCCCC 900 

35 ATAGCACCCT GCCCTCATGG GACCTGCCCT CCCTCAGCCG TCAGCCATCA GCCATGGCCC 960 

TCCCAGTGCC TCCTAGCCCC TTCTTCCAAG GAGCAGAGAG GTGGCCACCG GGGGTGGCTC 1020 

TGTCCTACCT CCACTCTCTG CCCCTAAAGA TGGGAGGAGA CCAGCGGTCC ATGGGTCTGG 1080 

CCTGTGAGTC TCCCCTTGCA GCCTGGTCAC TAGGCATCAC CCCCGCTTTG GTTCTTCAGA 114 0 

TGCTCTTGGG GTTCATAGGG GCAGGTCCTA GTCGGGCAGG GCCCCTGACC CTCCCGGCCT 12 00 

45 GGCTTCACTC TCCCTGACGG CTGCCATTGG TCCACCCTTT CATAGAGAGG CCTGCTTTGT 12 60 

TACAAAGCTC GGGTCTCCCT CCTGCAGCTC GGTTAAGTAC CCGAGGCCTC T CT T AAG AT G 1320 

TCCAGGGCCC CAGGCCCGCG GGCACAGCCA GCCCAAACCT TGGGCCCTGG AAGAGTCCTC 1380 

CACCCCATCA CTAGAGTGCT CTGACCCTGG GCTTTCACGG GCCCCATTCC ACCGCCTCCC 14 40 

CAACTTGAGC CTGTGACCTT GGGACCAAAG GGGGAGTCCC TCGTCTCTTG TGACTCAGCA 1500 

55 GAGGCAGTGG CCACGTTCAG GGAGGGGCCG GCTGGCCTGG AGGCTCAGCC CACCCTCCAG 1560 

CTTTTCCTCA GGGTGTCCTG AGGTCCAAGA TTCTGGAGCA ATCTGACCCT TCTCCAAAGG 1620 

CTCTGTT AT C AGCTGGGCAG TGCCAGCCAA TCCCTGGCCA TTTGGCCCCA GGGGACGTGG 1680 

GCCCTG 1686 



40 



50 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

i 

TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

Y5 CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 2 40 

CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 

20 

AAGAAGAAGC TGAAATACCT GCCCTACAAT CACCAGCACG AATACTTCTT CCTGATTGGG 3 60 

CCGCCGCTGC TCATCCCCAT GT AT T T C C AG TACCAGATCA TCATGACCAT GATCGTCCAT 4 20 

25 AAGAACTGGG TGGACCTGGC CTGGGCCGTC AGCTACTACA TCCGGTTCTT CATCACCTAC 4 80 

ATCCCTTTCT ACGGCATCCT GGGAGCCCTC CTTTTCCTCA AC T T C AT C AG GTTCCTGGAG 54 0 

AGCCACTGGT TTGTGTGGGT C AC AC AG AT G AATCACATCG TCATGGAGAT TGACCAGGAG 600 

30 

GCCTACCGTG ACTGGTTCAG TAGCCAGCTG ACAGCCACCT GCAACGTGGA GCAGTCCTTC 660 

TTCAACGACT GGTTCAGTGG ACACCTTAAC TTCCAGATTG AGCACCACCT CTTCCCCACC 720 

35 ATGCCCCGGC ACAACTTACA CAAGATCGCC CCGCTGGTGA AGTCTCTATG TGCCAAGCAT 7 80 

GGCATTGAAT AC C AG G AG AA GCCGCTACTG AGGGCCCTGC TGGACATCAT CAGGTCCCTG 840 

AAGAAGTCTG GGAAGCTGTG GCTGGACGCC TACCTTCACA AATGAAGCCA CAGCCCCCGG 900 

40 

GACACCGTGG GGAAGGGGTG CAGGTGGGGT GATGGCCAGA GGAATGATGG GCTTTTGTTC 960 

TGAGGGGTGT CCGAGAGGCT GGTGTATGCA CTGCTCACGG ACCCCATGTT GGATCTTTCT 1020 

45 CCCTTTCTCC TCTCCTTTTT CTCTTCACAT CTCCCCCATA GCACCCTGCC CTCATGGGAC 1080 

CTGCCCTCCC TCAGCCGTCA GCCATCAGCC ATGGCCCTCC CAGTGCCTCC TAGCCCCTTC 1140 

^ TTCCAAGGAG CAGAGAGGTG GCCACCGGGG GTGGCTCTGT CCTACCTCCA CTCTCTGCCC 1200 

CTAAAGATGG GAGGAGACCA GCGGTCCATG GGTCTGGCCT GTGAGTCTCC CCTTGCAGCC 12 60 

TGGTCACTAG GCATCACCCC CGCTTTGGTT CTTCAGATGC TCTTGGGGTT CATAGGGGCA 1320 

55 GGTCCTAGTC GGGCAGGGCC CCTGACCCTC CCGGCCTGGC TTCACTCTCC CTGACGGCTG 1380 

CCATTGGTCC ACCCTTTCAT AGAGAGGCCT GCTTTGTTAC AAAGCTCGGG TCTCCCTCCT 1440 

GCAGCTCGGT TAAGTACCCG AGGCCTCTCT TAAGATGTCC AGGGCCCCAG GCCCGCGGGC 1500 

60 

ACAGCCAGCC CAAACCTTGG GCCCTGGAAG AGTCCTCCAC CCCATCACTA GAGTGCTCTG 1560 

ACCCTGGGCT TTCACGGGCC CCATTCCACC GCCTCCCCAA CTTGAGCCTG TGACCTTGGG 1620 

65 ACCAAAGGGG GAGTCCCTCG TCTCTTGTGA CTCAGCAGAG GCAGTGGCCA CGTTCAGGGA 1680 
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GGGGCCGGCT GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 17 4 0 
TCCAAGATTC TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 1800 
5 CAGCCAATCC CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTG 184 3 

(2) INFORMATION FOR SEQ ID NO:33: 

10 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



20 


CAGGGACCTA 


CCCCGCGCTA 


CTTCACCTGG 


GACGAGGTGG 


CCCAGCGCTC 


AGGGTGCGAG 


60 




GAGCGGTGGC 


TAGTGATCGA 


CCGTAAGGTG 


TACAACATCA 


GCGAGTTCAC 


CCGCCGGCAT 


120 


25 


CCAGGGGGCT 


CCCGGGTCAT 


CAGCCACTAC 


GCCGGGCAGG 


ATGCCACGGA 


TCCCTTTGTG 


180 


GCCTTCCACA 


TCAACAAGGG 


CCTTGTGAAG 


AAGTATATGA 


ACTCTCTCCT 


GATTGGAGAA 


240 




CTGTCTCCAG 


AGCAGCCCAG 


CTTTGAGCCC 


ACCAAGAATA 


AAGAGCTGAC 


AGATGAGTTC 


300 


30 


CGGGAGCTGC 


GGGCCACAGT 


GGAGCGGATG 


GGGCTCATGA 


AGGCCAACCA 


TGTCTTCTTC 


3 60 




CTGCTGTACC 


TGCTG.CACAT 


CTTGCTGCTG 


GATGGTGCAG 


CCTGGCTCAC 


CCTTTGGGTC 


420 


35 


TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGCAG 


480 


GCCCAAGCTG 


GATGGCTGCA 


ACATGATTAT 


GGCCACCTGT 


V— J. O J. l_ i. r\\^r\\s 








TGGAACCACC 


TTGTCCACAA 


ATTCGTCATT 


GGCCACTTAA 


AGGGTGCCTC 


TGCCAACTGG 


600 


40 


TGGAATCATC 


GCCACTTCCA 


GCACCACGCC 


AAGCCTAACA 


TCTTCCACAA 


GGATCCCGAT 


660 




GTGAACATGC 


TGCACGTGTT 


TGTTCTGGGC 


GAATGGCAGC 


CCATCGAGTA 


CGGCAAGAAG 


720 


45 


AAGCTGAAAT 


ACCTGCCCTA 


CAATCACCAG 


CACGAATACT 


TCTTCCTGAT 


TGGGCCGCCG 


780 


CTGCTCATCC 


CCATGTATTT 


CCAGTACCAG 


AT CAT CAT G A 


CCATGATCGT 


CCATAAGAAC 


840 




TGGGTGGACC 


TGGCCTGGGC 


CGTCAGCTAC 


TACATCCGGT 


TCTTCATCAC 


CTACATCCCT 


900 


50 


TTCTACGGCA 


TCCTGGGAGC 


CCTCCTTTTC 


CTCAACTTCA 


TCAGGTTCCT 


GGAGAGCCAC 


960 




TGGTTTGTGT 


GGGTCACACA 


GATGAATCAC 


ATCGTCATGG 


AGATTGACCA 


GGAGGCCTAC 


1020 


55 


CGTGACTGGT 


TCAGTAGCCA 


GCTGACAGCC 


ACCTGCAACG 


TGGAGCAGTC 


CTTCTTCAAC 


1080 


GACTGGTTCA 


GTGGACACCT 


TAACTTCCAG 


ATTGAGCACC 


ACCTCTTCCC 


CACCATGCCC 


1140 




CGGCACAACT 


TACACAAGAT 


CGCCCCGCTG 


GTGAAGTCTC 


TATGTGCCAA 


GCATGGCATT 


1200 


60 


GAATACCAGG 


AGAAGCCGCT 


ACTGAGGGCC 


CTGCTGGACA 


TCATCAGGTC 


CCTGAAGAAG 


1260 




TCTGGGAAGC 


TGTGGCTGGA 


CGCCTACCTT 


CACAAATGAA 


GCCACAGCCC 


CCGGGACACC 


1320 


65 


GTGGGGAAGG 


GGTGCAGGTG 


GGGTGATGGC 


CAGAGGAATG 


ATGGGCTTTT 


GTTCTGAGGG 


1380 


GTGTCCGAGA 


GGCTGGTGTA 


TGCACTGCTC 


ACGGACCCCA 


TGTTGGATCT 


TTCTCCCTTT 


1440 
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CTCCTCTCCT TTTTCTCTTC ACATCTCCCC CATAGCACCC TGCCCTCATG GGACCTGCCC 1500 

TCCCTCAGCC GTCAGCCATC AGCCATGGCC CTCCCAGTGC CTCCTAGCCC CTTCTTCCAA 1560 

GGAGCAGAGA GGTGGCCACC GGGGGTGGCT CTGTCCTACC TCCACTCTCT GCCCCTAAAG 1620 

ATGGGAGGAG ACCAGCGGTC CATGGGTCTG GCCTGTGAGT CTCCCCTTGC AGCCTGGTCA 1680 

10 CT AG GC AT C A CCCCCGCTTT GGTTCTTCAG ATGCTCTTGG GGTTCATAGG GGCAGGTCCT 174 0 

AGTCGGGCAG GGCCCCTGAC CCTCCCGGCC TGGCTTCACT CTCCCTGACG GCTGCCATTG 1800 

GTCCACCCTT T CAT AG AG AG GCCTGCTTTG TTACAAAGCT CGGGTCTCCC TCCTGCAGCT 1860 

15 

CGGTTAAGTA CCCGAGGCCT CTCTTAAGAT GTCCAGGGCC CCAGGCCCGC GGGCACAGCC 1920 

AGCCCAAACC TTGGGCCCTG GAAGAGTCCT CCACCCCATC ACTAGAGTGC TCTGACCCTG 1980 

20 GGCTTTCACG GGCCCCATTC CACCGCCTCC CCAACTTGAG CCTGTGACCT TGGGACCAAA 204 0 

GGGGGAGTCC CTCGTCTCTT GTGACTCAGC AGAGGCAGTG GCCACGTTCA GGGAGGGGCC 2100 

GGCTGGCCTG GAGGCTCAGC CCACCCTCCA GCTTTTCCTC AGGGTGTCCT GAGGTCCAAG 2160 

ATTCTGGAGC AATCTGACCC TTCTCCAAAG GCTCTGTTAT CAGCTGGGCA GTGCCAGCCA 2220 

ATCCCTGGCC ATTTGGCCCC AGGGGACGTG GGCCCTG 2257 



25 



30 



40 



45 



50 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 
35 (B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 2692004) 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



His 


Ala 


Asp 


Arg 


Arg 


Arg 


Glu 


He 


Leu 


Ala 


Lys 


Tyr 


Pro 


Glu 


He 


1 








5 










10 










15 


Lys 


Ser 


Leu 


Met 


Lys 


Pro 


Asp 


Pro 


Asn 


Leu 


He 


Trp 


He 


He 


He 










20 










25 










30 


Met 


Met 


Val 


Leu 


Thr 


Gin 


Leu 


Gly 


Ala 


Phe 


Tyr 


He 


Val 


Lys 


Asp 










35 










40 








45 


Leu 


Asp 


Trp 


Lys 


Trp 


Val 


He 


Phe 


Gly 


Ala 


Tyr 


Ala 


Phe 


Gly 


Ser 










50 










55 








60 


Cys 


lie 


Asn 


His 


Ser 


Met 


Thr 


Leu 


Ala 


He 


His 


Glu 


He 


Ala 


His 










65 










70 










75 


Asn 


Ala 


Ala 


Phe 


Gly 


Asn 


Cys 


Lys 


Ala 


Met 


Trp 


Asn 


Arg 


Trp 


Phe 










80 










85 










90 


Gly 


Met 


Phe 


Ala 


Asn 


Leu 


Pro 


He 


Gly 


He 


Pro 


Tyr 


Ser 


He 


Ser 










95 










100 










105 


Phe 


Lys 


Arg 


Tyr 


His 


Met 


Asp 


His 


His 


Arg 


Tyr 


Leu 


Gly 


Ala 


Asp 










110 










115 










120 


Gly 


Val 


Asp 


Val 


Asp 


He 


Pro 


Thr 


Asp 


Phe 


Glu 


Gly 


Trp 


Phe 


Phe 










125 










130 






135 


Cys 


Thr 


Ala 


Phe 


Arg 


Lys 


Phe 


He 


Trp 


Val 


He 


Leu 


Gin 


Pro 


Leu 










140 










145 










150 


Phe 


Tyr 


Ala 


Phe 


Arg 


Pro 


Leu 


Phe 


He 


Asn 


Pro 


Lys 


Pro 


He 


Thr 










155 










160 








165 


Tyr 


Leu 


Glu 


Val 


He 


Asn 


Thr 


Val 


Ala 


Gin 


Val 


Thr 


Phe 


Asp 


He 
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170 175 180 

Leu lie Tyr Tyr Phe Leu Gly lie Lys Ser Leu Val Tyr Met Leu 

185 190 195 

Ala Ala Ser Leu Leu Gly Leu Gly Leu His Pro lie Ser Gly His 

5 200 205 210 

Phe lie Ala Glu His Tyr Met Phe Leu Lys Gly His Glu Thr Tyr 

215 220 225 

Ser Tyr Tyr Gly Pro Leu Asn Leu Leu Thr Phe Asn Val Gly Tyr 

230 235 240 

10 His Asn Glu His His Asp Phe Pro Asn lie Pro Gly Lys Ser Leu 

245 250 255 

' Pro Leu Val Arg Lys lie Ala Ala Glu Tyr Tyr Asp Asn Leu Pro 

260 265 270 

\ His Tyr Asn Ser Trp lie Lys Val Leu Tyr Asp Phe Val Met Asp 

15 275 280 285 

Asp Thr lie Ser Pro Tyr Ser Arg Met Lys Arg His Gin Lys Gly 

290 295 300 

Glu Met Val Leu Glu *** lie Ser Leu Val Pro Lys Gly Phe Phe 

305 310 315 

20 Ser Lys Thr Leu Asp Asp Lys Met Glu Phe Leu His Tyr *** Thr 

320 325 330 

*** Asp Gin *** Cys Ser Glu Ala Pro Leu Ala Gin Phe Gin Ser 

335 340 345 
Lys Ser Ser Val lie Pro Arg Ser Glu Ser Gly Phe *** Thr Val 

25 350 355 360 

Ser Leu Thr Leu Tyr Cys Ser Val Ser Leu Thr Gly Asn Leu *** 

365 370 375 

Leu Val Tyr Tyr Arg His *** Gly Cys Phe Thr His Val Cys His 

380 385 ^ 390 

30 Phe lie Ser He Ser Phe Lys Lys Leu Leu Lys Ser Tyr Phe Ala 

400 405 410 

Arg 



35 



45 



50 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



Tyr 


Leu 


Leu 


Arg 


Pro 


Leu 


Leu 


Pro 


His 


Leu 


Cys 


Ala 


Thr 


He 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Ser 


Phe 


Leu 
20 


Gly 


Leu 


Phe 


Phe 


He 
25 


Val 


Arg 


Phe 


Leu 


Glu 
30 


Ser 


Asn 


Trp 


Phe 


val 
35 


Trp 


Val 


Thr 


Gin 


Met 
40 


Asn 


His 


He 


Pro 


Met 
45 


His 


He 


Asp 


His 


Asp 
50 


Arg 


Asn 


Met 


Asp 


Trp 
55 


Val 


Ser 


Thr 


Gin 


Leu 
60 


Gin 


Ala 


Thr 


Cys 


Asn 
65 


Val 


His 


Lys 


Ser 


Ala 
70 


Phe 


Asn 


Asp 


Trp 


Phe 
75 


Ser 


Gly 


His 


Leu 


Asn 
80 


Phe 


Gin 


He 


Glu 


His 
85 


His 


Leu 


Phe 


Pro 


Thr 
90 


Met 


Pro 


Arg 


His 


Asn 
95 


Tyr 


His 


Lys 


Val 


Ala 
100 


Pro 


Leu 


Val 


Gin 


Ser 
105 


Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 


Tyr 


Gin 


Ser 


Lys 


Pro 


Leu 


Leu 










110 










115 








120 


Ser 


Ala 


Phe 


Ala 


Asp 


He 


He 


His 


Ser 


Leu 


Lys 


Glu 


Ser 


Gly 


Gin 










125 










130 








135 


Leu 


Trp 


Leu 


Asp 


Ala 
140 


Tyr 


Leu 


His 


Gin 


* + * 

145 


Gin 


Gin 


Pro 


Pro 


Cys 
150 
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Pro 


Val 


Trp 


Lys 


Lys 
155 


Arg 


Arg Lys 


Gly 


Ala 


* * + 


Gly 


Thr 
170 


Met 


Pro Leu 


Leu 


Gly 


Leu 


Gly 


Thr 
185 


* * * 


Ser Leu 


lie 


Phe 


* ★ * 


Pro 


Gin 
200 


Phe 


*** Asp 


Glu 


val 


Pro 


Arg 


Arg 
215 


Glu 


Gly Ala 



Thr 


Leu 


Glu 


Pro 


Arg 


Gin 


Arg 




160 










165 


★ * * 


Phe 


Asn 


Thr 


Gin 


Arg 


Gly 




175 










180 


* * * 


Leu 


Lys 


Leu 


Leu 


Pro 


Phe 




190 










195 


Pro 


Lys 


Trp 


Gly 


Val 


Asp 


Thr 




205 










210 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Val 


Phe 


Tyr 


Phe 


Gly 


Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 


1 








5 










10 










15 


Phe 


Val 


Leu 


Ala 


Thr 
20 


Ser 


Gin 


Ala 


Gin 


Ala 
25 


Gly 


Trp 


Leu 


Gin 


His 
30 


Asp 


Tyr 


Gly 


His 


Leu 
35 


Ser 


Val 


Tyr 


Arg 


Lys 
40 


Pro 


Lys 


Trp 


Asn 


His 
45 


Leu 


Val 


His 


Lys 


Phe 
50 


Val 


He 


Gly 


His 


Leu 
55 


Lys 


Gly 


Ala 


Ser 


Ala 
60 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 










65 










70 








75 


Leu 


Gly 


Glu 


Trp 


Gin 
80 


Pro 


He 


Glu 


Tyr 


Gly 
85 


Lys 


Xxx 









(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly 


Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 


Asp 


Arg 


Lys 


Val 










20 










25 






30 


Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly 


Gly 


Ser 


Arg 










35 










40 










45 


Val 


He 


Ser 


His 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr 


Asp 


Pro 


Phe 


Val 










50 










55 








60 


Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr 


Met 


Asn 


Ser 










65 










70 








75 


Leu 


Leu 


He 


Gly 


Glu 


Leu 


Ser 


Pro 


Glu 


Gin 


Pro 


Ser 


Phe 


Glu 


Pro 
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80 










85 










90 


Thr 


Lys 


Asn 


Lys 


Glu 
95 


Leu 


Thr 


Asp 


Glu 


Phe 
100 




Glu 




Arg 


rtX d 

105 


Thr 


Val 


Glu Arg 


Met 


Glv 


Leu 


Met 


Lys 


Al a 
nl a 




Hie: 


val 


rne 


Ph o 










110 










115 










120 


Leu 


Leu 


i y x 


Leu 


Leu 
125 


His 


He 


Leu 


Leu 


Leu 
130 


Asp 


r:i \/ 
oiy 


t\J.a 


TV 1 


Trp 
135 


Leu 


Thr 




Trp 


Val 
140 


Phe 


Gly 


1 11 X 


OCX 


xrne 
145 


Leu 


P v r-i 

rro 




Leu 


Leu 
150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Ala 


Gin 


Ala 


uiy 


Trp 


Leu 










155 










160 








165 


Gin 


His 


As p 


Phe 


Glv 
170 


His 


Leu 


Ser 


Val 


Phe 
175 




Th t- 
1 III. 




Lys 


Trp 
180 


Asn 


His 






His 
185 


His 


Phe 


Val 


He 


Gly 
190 


His 




Lys 


v>iy 


rVX Ci 

195 


Pro 


Ala 


oci 


Trp 


TrD 
200 


Asn 


His 


Met 


His 


Phe 
205 


Gin 


His 


His 


Ala 


210 


Pro 


Asn 


Cys 


Phe 


Ar g 
215 


Lys 


• Asp 


Pro 




He 
220 


Asn 


l it; U 


nio 


Pro 


rile 

225 


Phe 


Phe 


Ala 


Leu 


Gly 
230 




lie 




Ser 


Val 

235 


oJ. U 


Leu 




Lys 


bin 
240 


Lys 


Lys 


Lys 


Tyr 


Met 
245 


Pro 


Tyr 


Asn 


His 


Gin 
250 


His 


Xxx 


Tyr 


Phe 


Phe 
255 


Leu 


He 


Gly 


Pro 


Pro 
260 


Ala 


Leu 


Leu 


Pro 


Leu 
265 


Tyr 


Phe 


Gin 


Trp 


Tyr 
270 


He 


Phe 


Tyr 


Phe 


Val 
275 


He 


Gin 


Arg 


Lys 


Lys 
280 


Trp 


Val 


Asp 


Leu 


Ala 
285 


Trp 


He 


Ser 


Lys 


Gin 
290 


Glu 


Tyr 


Asp 


Glu 


Ala 
295 


Gly 


Leu 


Pro 


Leu 


Ser 
300 


Thr 


Ala 


Asn 


Ala 


Ser 
305 


Lys 





















(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


1 








5 










10 










15 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 










20 










25 










30 


Asn 


Met 


Leu 


His 


Val 


Phe 


Val 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 










35 










40 








45 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 










50 










55 










60 


Glu 


Tyr 


Phe 


Phe 


Leu 


lie 


Gly 


Pro 


Pro 


Leu 


Leu 


He 


Pro 


Met 


Tyr 










65 










70 










75 


Phe 


Gin 


Tyr 


Gin 


He 


He 


Met 


Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 










80 










85 










90 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


He 










95 










100 










105 


Thr 


Tyr 


He 


Pro 


Phe 


Tyr 


Gly 


He 


Leu 


Gly 


Ala 


Leu 


Leu 


Phe 


Leu 










110 










115 










120 


Asn 


Phe 


He 


Arg 


Phe 


Leu 


Glu 


Ser 


His 


Trp 


Phe 


Val 


Trp 


Val 


Thr 










125 










130 








135 


Gin 


Met 


Asn 


His 


He 


Val 


Met 


Glu 


He 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 










140 










145 








150 
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Asp 


Trp 


Phe 


Ser 


Ser 
155 


Gin 


Leu 


Thr 


Ala 


Thr 
160 


Cys 


Asn 


Val 


Glu 


Gin 
165 


Ser 


Phe 


Phe 


Asn 


Asp 
170 


Trp 


Phe 


Ser 


Gly 


His 
175 


Leu 


Asn 


Phe 


Gin 


He 
180 


Glu 


His 


His 


Leu 


Phe 
185 


Pro 


Thr 


Met 


Pro 


Arg 
190 


His 


Asn 


Leu 


His 


Lys 
195 


He 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 


Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 










200 










205 








210 


Tyr 


Gin 


Glu 


Lys 


Pro 
215 


Leu 


Leu 


Arg 


Ala 


Leu 
220 


Leu 


Asp 


He 


He 


Arg 
225 


Ser 


Leu 


Lys 


Lys 


Ser 


Gly 


Lys 


Leu 


Trp 


Leu 


Asp 


Ala 


Tyr 


Leu 


His 










230 










235 








240 


Lys 


+ * * 


Ser 


His 


Ser 
245 


Pro 


Arg 


Asp 


Thr 


Val 
250 


Gly 


Lys 


Glv 


Cys 


Arg 
255 


Trp 


Gly 


Asp 


Gly 


Gin 


Arg 


Asn 


Asp 


Gly 


Leu 


Leu 


Phe 


* * * 


Glv 


Val 










2 60 










265 








270 


Ser 


Glu 


Arg 


Leu 


Val 
275 


Tyr 


Ala 


Leu 


Leu 


Thr 
280 


Asp 


Pro 


Met 


Leu 


Asp 
285 


Leu 


Ser 


Pro 


Phe 


Leu 
290 


Leu 


Ser 


Phe 


Phe 


Ser 
295 


Ser 


His 


Leu 


Pro 


His 
300 


Ser 


Thr 


Leu 


Pro 


Ser 


Trp 


Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


Arg 


Gin 


Pro 










305 










310 








315 


Ser 


Ala 


Met 


Ala 


Leu 
320 


Pro 


Val 


Pro 


Pro 


Ser 
325 


Pro 


Phe 


Phe 


Gin 


Gly 
330 


Ala 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly 


Val 


Ala 


Leu 


Ser 


Tvr 


Leu 


His 


Ser 










335 










340 








345 


Leu 


Pro 


Leu 


Lys 


Met 


Gly 


Gly 


Asp 


Gin 


Arg 


Ser 


Met 


Glv 


Leu 


Ala 










350 










355 








360 


Cys 


Glu 


Ser 


Pro 


Leu 
365 


Ala 


Ala 


Trp 


Ser 


Leu 
370 


Gly 


lie 


Thr 


Pro 


Ala 
375 


Leu 


Val 


Leu 


Gin 


Met 


Leu 


Leu 


Gly 


Phe 


He 


Gly 


Ala 


Gly 


Pro 


Ser 










380 










385 








390 


Arg 


Ala 


Gly 


Pro 


Leu 
400 


Thr 


Leu 


Pro 


Ala 


Trp 
405 


Leu 


His 


Ser 


Pro 


* * + 
410 


Arg 


Leu 


Pro 


Leu 


Val 
415 


His 


Pro 


Phe 


He 


Glu 
420 


Arg 


Pro 


Ala 


Leu 


Leu 
425 


Gin 


Ser 


Ser 


Gly 


Leu 


Pro 


Pro 


Ala 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Glv 










430 










435 








440 


Leu 


Ser 


* * * 


Asp 


Val 


Gin 


Gly 


Pro 


Arg 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 










445 










450 








455 


Pro 


Asn 


Leu 


Gly 


Pro 
460 


Trp 


Lys 


Ser 


Pro 


Pro 
465 


Pro 


His 


His 


* *• * 


Ser 
470 


Ala 


Leu 


Thr 


Leu 


Gly 
475 


Phe 


His 


Gly 


Pro 


His 
480 


Ser 


Thr 


Ala 


Ser 


Pro 

H O D 


Thr 


* * + 


Ala 


Cys 


Asp 


Leu 


Gly 


Thr 


Lys 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 










490 










495 








500 


Leu 


+ *■ * 


Leu 


Ser 


Arg 


Gly 


Ser 


Gly 


His 


Val 


Gin 


Gly 


Gly 


Ala 


Gly 










505 










510 








515 


Trp 


Pro 


Gly 


Gly 


Ser 


Ala 


His 


Pro 


Pro 


Ala 


Phe 


Pro 


Gin 


Gly 


Val 










520 










525 








530 


Leu 


Arg 


Ser 


Lys 


He 


Leu 


Glu 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 










535 










540 








545 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 


Gin 


Pro 


He 


Pro 


Gly 


His 


Leu 


Ala 










550 










555 








560 


Pro 


Gly 


Asp 


Val 


Gly 
565 


Pro 


Xxx 



















(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

5 





Val 


Phe 


Tyr 


Phe 


Gly Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 




1 








5 










10 










15 




Phe 


Val 


Leu 


Ala 


Thr 


Ser 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 


Gin 


His 


10 










20 










25 






30 


i 


Asp 


Tyr 


Gly 


His 


Leu 
35 


Ser 


Val 


Tyr 


Arg 


Lys 
40 


Pro 


Lys 


Trp 


Asn 


His 
45 




Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 


15 










50 










55 










60 


Asn 


Trp 


Trp 


Asn 


His 
65 


Arg 


His 


Phe 


Gin 


His 
70 


His 


Ala 


Lys 


Pro 


Asn 
75 




He 


Phe 


His 


Lys 


Asp 


Pro Asp 


Val 


Asn 


Met 


Leu 


His 


Val 


Phe 


Val 












80 










85 










90 


20 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 










95 










100 










105 




Tyr 


Leu 


Pro 


Tyr 


Asn 
110 


His 


Gin 


His 


Glu 


Tyr 
115 


Phe 


Phe 


Leu 


He 


Gly 
120 




Pro 


Pro 


Leu 


Leu 


He 


Pro 


Met 


Tyr 


Phe 


Gin 


Tyr 


Gin 


He 


He 


Met 


25 










125 










130 










135 


Thr 


Met 


lie 


Val 


His 
140 


Lys 


Asn 


Trp 


Val 


Asp 
145 


Leu 


Ala 


Trp 


Ala 


Val 
150 




Ser 


Tyr 


Tyr 


He 


Arg 
155 


Phe 


Phe 


He 


Thr 


Tyr 
160 


He 


Pro 


Phe 


Tyr 


Gly 
165 


30 


He 


Leu 


Gly 


Ala 


Leu 


Leu Phe 


Leu 


Asn 


Phe 


He 


Arg 


Phe 


Leu 


Glu 










170 










175 








180 




Ser 


His 


Trp 


Phe 


Val 
185 


Trp 


Val 


Thr 


Gin 


Met 
190 


Asn 


His 


He 


Val 


Met 
195 




Glu 


He 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 


Asp 


Trp 


Phe 


Ser 


Ser 


Gin 


Leu 


35 










200 










205 










210 


Thr 


Ala 


Thr 


Cys 


Asn 
215 


Val 


Glu 


Gin 


Ser 


Phe 
220 


Phe 


Asn 


Asp 


Trp 


Phe 
225 




Ser 


Gly 


His 


Leu 


Asn 
230 


Phe 


Gin 


He 


Glu 


His 
235 


His 


Leu 


Phe 


Pro 


Thr 
240 


A f\ 

4U 


Met 


Pro 


Arg 


His 


Asn 


Leu 


His 


Lys 


He 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 










245 










250 








255 




Leu 


Cys 


Ala 


Lys 


His 
260 


Gly 


He 


Glu 


Tyr 


Gin 
265 


Glu 


Lys 


Pro 


Leu 


Leu 
270 




Arg 


Ala 


Leu 


Leu 


Asp 


He 


He 


Arg 


Ser 


Leu 


Lys 


Lys 


Ser 


Gly 


Lys 


A ^ 










275 










280 










285 


Leu 


Trp 


Leu 


Asp 


Ala 
290 


Tyr 


Leu 


His 


Lys 


* ★ * 

295 


Ser 


His 


Ser 


Pro 


Arg 
300 




Asp 


Thr 


Val 


Gly 


Lys 
305 


Gly 


Cys 


Arg 


Trp 


Gly 
310 


Asp 


Gly 


Gin 


Arg 


Asn 
315 


50 


Asp 


Gly 


Leu 


Leu 


Phe 


* * * 


Gly 


Val 


Ser 


Glu 


Arg 


Leu 


Val 


Tyr 


Ala 










320 










325 








330 




Leu 


Leu 


Thr 


Asp 


Pro 
335 


Met 


Leu 


Asp 


Leu 


Ser 
340 


Pro 


Phe 


Leu 


Leu 


Ser 
345 




Phe 


Phe 


Ser 


Ser 


His 


Leu 


Pro 


His 


Ser 


Thr 


Leu 


Pro 


Ser Trp Asp 


55 










350 










355 










360 


Leu 


Pro 


Ser 


Leu 


Ser 
365 


Arg 


Gin 


Pro 


Ser 


Ala 
370 


Met 


Ala 


Leu 


Pro 


Val 
375 




Pro 


Pro 


Ser 


Pro 


Phe 


Phe 


Gin 


Gly 


Ala 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly 












380 










385 








390 


60 


Val 


Ala 


Leu 


Ser 


Tyr 


Leu 


His 


Ser 


Leu 


Pro 


Leu 


Lys 


Met 


Gly 


Gly 










400 










405 






410 




Asp 


Gin 


Arg 


Ser 


Met 
415 


Gly 


Leu 


Ala 


Cys 


Glu 
420 


Ser 


Pro 


Leu 


Ala 


Ala 
425 




Trp 


Ser 


Leu 


Gly 


He 


Thr 


Pro 


Ala 


Leu 


Val 


Leu 


Gin 


Met 


Leu 


Leu 


65 










430 










435 










440 


Gly 


Phe 


He 


Gly 


Ala 
445 


Gly 


Pro 


Ser 


Arg 


Ala 
450 


Gly 


Pro 


Leu 


Thr 


Leu 
455 
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Pro 


Ala 


Trp 


Leu 


His 
4 60 


Ser 


Pro 




Arg 


Leu 
465 


Pro 


Leu 


Val 


His 


Pro 
470 




Phe 


He 


Glu 


Arg 


Pro 
475 


Ala 


Leu 


Leu 


Gin 


Ser 
480 


Ser 


Gly 


Leu 


Pro 


Pro 
485 


5 


Ala 


Ala 


Arg 


Leu 


Ser 
4 90 


Thr 


Arg 


Gly 


Leu 


Ser 
495 


* ** 


Asp 


Val 


Gin 


Gly 
500 




Pro 


Arg 


Pro 


Ala 


Gly 
505 


Thr 


Ala 


Ser 


Pro 


Asn 
510 


Leu 


Gly 


Pro 


Trp 


Lys 
515 




Ser 


Pro 


Pro 


Pro 


His 


His 


★ ★ ★ 


Ser 


Ala 


Leu 


Thr 


Leu 


Gly 


Phe 


His 


1 U 










520 










525 








530 




Gly 


Pro 


His 


Ser 


Thr 
535 


Ala 


Ser 


Pro 


Thr 


* ** 
540 


Ala 


Cys 


Asp 


Leu 


Gly 
545 




Thr 


Lys 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 


Leu 


* * * 


Leu 


Ser 


Arg 


Gly 


Ser 


15 










550 










555 










560 


Gly 


His 


Val 


Gin 


Gly 
565 


Gly 


Ala 


Gly 


Trp 


Pro 
570 


Gly 


Gly 


Ser 


Ala 


His 
575 




Pro 


Pro 


Ala 


Phe 


Pro 
580 


Gin 


Gly 


Val 


Leu 


Arg 
585 


Ser 


Lys 


He 


Leu 


Glu 
590 


20 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 










595 










600 








605 




Gin 


Pro 


He 


Pro 


Gly 
610 


His 


Leu 


Ala 


Pro 


Gly 
615 


Asp 


Val 


Gly 


Pro 


Xxx 
620 



25 



35 



40 



45 



50 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 253538a) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly 


Cys 


Glu 
20 


Glu 


Arg 


Trp 


Leu 


Val 
25 


He 


Asp 


Arg 


Lys 


Val 
30 


Tyr 


Asn 


He 


Ser 


Glu 
35 


Phe 


Thr 


Arg 


Arg 


His 
40 


Pro 


Gly 


Gly 


Ser 


Arg 
45 


Val 


He 


Ser 


His 


Tyr 
50 


Ala 


Gly 


Gin 


Asp 


Ala 
55 


Thr 


Asp 


Pro 


Phe 


Val 
60 


Ala 


Phe 


His 


He 


Asn 
65 


Lys 


Gly 


Leu 


Val 


Lys 
70 


Lys 


Tyr 


Met 


Asn 


Ser 
75 


Leu 


Leu 


He 


Gly 


Glu 
80 


Leu 


Ser 


Pro 


Glu 


Gin 
85 


Pro 


Ser 


Phe 


Glu 


Pro 
90 


Thr 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 


Arg 


Ala 










95 










100 








105 


Thr 


Val 


Glu 


Arg 


Met 
110 


Gly 


Leu 


Met 


Lys 


Ala 
115 


Asn 


His 


Val 


Phe 


Phe 
120 


Leu 


Leu 


Tyr 


Leu 


Leu 
125 


His 


He 


Leu 


Leu 


Leu 
130 


Asp 


Gly 


Ala 


Ala 


Trp 
135 


Leu 


Thr 


Leu 


Trp 


Val 
140 


Phe 


Gly 


Thr 


Ser 


Phe 
145 


Leu 


Pro 


Phe 


Leu 


Leu 
150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 










155 










160 








165 


Leu 


Gin 


His 


Asp 


Tyr 
170 


Gly 


His 


Leu 


Ser 


Val 
175 


Tyr 


Arg 


Lys 


Pro 


Lys 
180 


Trp Asn 


His 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 










185 










190 








195 


Ala 


Ser 


Ala 


Asn 


Trp 
200 


Trp 


Asn 


His 


Arg 


His 
205 


Phe 


Gin 


His 


His 


Ala 
210 
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Lys Pro Asn 
Val Phe Val 
Lys Leu Lys 
Leu lie Gly 
lie lie Met 
Trp Ala Val 
Phe Tyr Gly 
Phe Leu Glu 
He Val Met 
Ser Gin Leu 
Asp Trp Phe 
Phe Pro Thr 
Val Lys Ser 
Pro Leu Leu 
Ser Gly Lys 
Ser Pro Arg 
Gin Arg Asn 
Val Tyr Ala 
Leu Leu Ser 
Ser Trp Asp 
Leu Pro Val 
Pro Pro Gly 
Met Gly Gly 
Leu Ala Ala 
Met Leu Leu 
Leu Thr Leu 
Val His Pro 
Leu Pro Pro 
Val Gin Gly 
Pro Trp Lys 
Gly Phe His 
Asp Leu Gly 
Arg Gly Ser 



He 


Pne 


His 




£. 1 D 




Leu 


C 1 i » 

c^iy 


Glu 




£. o u 




Tyr 


Leu 


Pro 




£. H O 




Pro 


Pro 


Leu 




2 60 




Thr 


Met 


He 




275 




Ser 


Tyr 


Tyr 




z. y u 




He 


Leu 


Gly 




305 




Ser 


His 


Trp 




320 




Glu 


He 


Asp 




335 




Thr 


Ala 


Thr 




3 50 




Ser 


Gly 


His 




3 65 




Met 


Pro 


Arg 




380 




Leu 


Cys 


Ala 




400 




Arg 


Ala 


Leu 




415 




Leu 


Trp 


Leu 




430 




Asp 


Thr 


Val 




4 45 




Asp 


Gly 


Leu 




4 60 




Leu 


Leu 


Thr 




475 




Phe 


Phe 


Ser 




4 90 




Leu 


Pro 


Ser 




505 




Pro 


Pro 


Ser 




520 




Val 


Ala 


Leu 




D JO 




Asp 


Lsin 


Arg 




c, cn 
OjU 




Trp 


Ser 


Leu 




565 




Gly 


Phe 


He 




580 




Pro 


A±a 


Trp 








Pne 


He 


Glu 








Ala 


Ala 


Arg 




625 




Pro 


Arg 


Pro 




640 




Ser 


Pro 


Pro 




655 




Gly 


Pro 


His 




67 0 




Thr 


Lys 


Gly 




685 




Gly 


His 


Val 




700 





Lys Asp Pro 
Trp Gin Pro 
Tyr Asn His 
Leu He Pro 
Val His Lys 
He Arg Phe 
Ala Leu Leu 
Phe Val Trp 
Gin Glu Ala 
Cys Asn Val 
Leu Asn Phe 
His Asn Leu 
Lys His Gly 
Leu Asp He 
Asp Ala Tyr 
Gly Lys Gly 
Leu Phe *** 
Asp Pro Met 
Ser His Leu 
Leu Ser Arg 
Pro Phe Phe 
Ser Tyr Leu 
Ser Met Gly 
Gly He Thr 
Gly Ala Gly 
Leu His Ser 
Arg Pro Ala 
Leu Ser Thr 
Ala Gly Thr 
Pro His His 
Ser Thr Ala 
Gly Val Pro 
Gin Gly Gly 



Asp 


Val 


Asn 


220 






He 


Glu 


Tyr 


235 






Gin 


His 


Glu 


250 






Met 


Tyr 


Phe 


2 65 






Asn 


Trp 


Val 


280 






Phe 


He 


Thr 


295 






Phe 


Leu 


Asn 


310 






Val 


Thr 


Gin 


325 






Tyr 


Arg 


Asp 


340 






Glu 


Gin 


Ser 


355 






Gin 


He 


Glu 


370 






His 


Lys 


lie 


385 






He 


Glu 


Tyr 


405 






He 


Arg 


Ser 


420 






Leu 


His 


Lys 


435 






Cys 


Arg 


Trp 


450 






Gly 


Val 


Ser 


465 






Leu 


Asp 


Leu 


480 






Pro 


His 


Ser 


495 






Gin 


Pro 


Ser 


510 






Gin 


Gly 


Ala 


525 






His 


Ser 


Leu 


54 0 






Leu 


Ala 


Cys 


555 






Pro 


Ala 


Leu 


570 






Pro 


Ser 


Arg 


585 






Pro 


* ** 


Arg 


600 






Leu 


Leu 


Gin 


olo 






Arg 


Gly 


Leu 


630 






Ala 


Ser 


Pro 


645 






+ * ★ 


Ser 


Ala 


660 






Ser 


Pro 


Thr 


675 






Arg 


Leu 


Leu 


690 






Ala 


Gly 


Trp 


705 
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Met 


Leu 


His 






225 


Gly 


Lys 


Lys 






240 


Tyr 


Phe 


Phe 






255 


Gin 


Tyr 


Gin 






270 


Asp 


Leu 


Ala 






285 


Tyr 


He 


Pro 






300 


Phe 


He 


Arg 






315 


Met 


Asn 


His 






330 


Trp 


Phe 


Ser 






345 


Phe 


Phe 


Asn 






360 


His 


His 


Leu 






375 


Ala 


Pro 


Leu 






390 


Gin 


Glu 


Lys 






410 


Leu 


Lys 


Lys 






425 


* ★ * 


Ser 


His 






440 


Gly 


Asp 


Gly 






455 


Glu 


Arg 


Leu 






470 


Ser 


Pro 


Phe 






485 


Thr 


Leu 


Pro 






500 


Ala 


Met 


Ala 






515 


Glu 


Arg 


Trp 






530 


Pro 


Leu 


Lys 






545 


Glu 


Ser 


Pro 






560 


Val 


Leu 


Gin 






575 


Ala 


Gly 


Pro 






590 


Leu 


Pro 


Leu 






605 


Ser 


Ser 


Gly 






620 


Ser 




Asp 






635 


Asn 


Leu 


Gly 






650 


Leu 


Thr 


Leu 






665 


★ ★ * 


Ala 


Cys 






680 


★ * * 


Leu 


Ser 






695 


Pro 


Gly 


Gly 






710 
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Ser Ala His Pro Pro Ala Phe Pro Gin Gly Val Leu Arg Ser Lys 

715 720 725 

lie Leu Glu Gin Ser Asp Pro Ser Pro Lys Ala Leu Leu Ser Ala 

730 735 740 

Gly Gin Cys Gin Pro lie Pro Gly His Leu Ala Pro Gly Asp Val 

745 750 755 

Gly Pro Xxx 
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What is claimed is: 

1. An isolated nucleic acid comprising: 

a nucleotide sequence depicted in SEQ ID NO: 1 or SEQ ID NO: 3. 

5 

2. A polypeptide encoded by a nucleotide sequence according to claim 1 . 

3. A purified or isolated polypeptide comprising an amino acid sequence 
depicted in SEQ ID NO: 2 or SEQ ID NO: 4. 

10 

4. An isolated nucleic acid encoding a polypeptide having an amino acid 
sequence depicted in SEQ ID NO: 2 or SEQ ID NO: 4. 

5. An isolated nucleic acid comprising a nucleotide sequence which encodes a 
15 polypeptide which desaturates a fatty acid molecule at carbon 6 or 12 from the 

carboxyl end of said polypeptide, wherein said nucleotide sequence has an average 
A/T content of less than about 60%. 

6. The isolated nucleic acid according to Claim 5, wherein said nucleic acid is 
20 derived from a fungus. 

7. The isolated nucleic acid according to Claim 6, wherein said fungus is of the 
genus Mortierella. 

25 8. The isolated nucleic acid according to Claim 7, wherein said fungus is of the 

species Mortierella alpina. 
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9. An isolated nucleic acid, wherein the nucleotide sequence of said nucleic 
acid is depicted in SEQ ID NO: 1. or SEQ ID NO: 3. 



1 0. An isolated or purified polypeptide which desaturates a fatty acid molecule at 
carbon 6 or 1 2 from the carboxyl end of said polypeptide, wherein said polypeptide 
is a eukaryotic polypeptide or is derived from a eukaryotic polypeptide. 



1 1 . The isolated or purified eukaryotic polypeptide according to Claim 10, 
wherein said eukaryotic polypeptide is derived from a fungus. 

10 

12. A nucleic acid comprising: 

a fungal nucleotide sequence which is substantially identical to a sequence of at 
least 50 nucleotides in SEQ ID NO: 1 or SEQ ID NO: 3 or is complementary to a 
sequence of at least 50 nucleotides in SEQ ID NO: 1 or SEQ ID NO: 3. 

15 

13. An isolated nucleic acid having a nucleotide sequence with at least about 
50% homology to SEQ ID NO: 1 or SEQ ID NO: 3. 



14. An isolated nucleic acid having a nucleotide sequence with at least about 
20 50% homology to sequence encoding an amino acid sequence depicted in SEQ ID 

NO: 2 or SEQ ID NO: 4. 



15. The nucleic acid of claim 14, wherein said amino acid sequence depicted in 
SEQ ID NO: 2 is selected from the group consisting of amino acid residues 50-53, 
25 39-43, 172-176, 204-213, and 390-402. 



1 6. A nucleic acid construct comprising: 
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a nucleotide sequence depicted in a SEQ ID NO: 1 or SEQ ID NO: 3 linked 
to a heterologous nucleic acid. 

17. A nucleic acid construct comprising: 

5 a nucleotide sequence depicted in a SEQ ID NO: 1 or SEQ ID NO: 3 

\ operably associated with an expression control sequence functional in a microbial 

\ cell. 

18. The nucleic acid construct according to Claim 17, wherein said microbial 
10 cell is a yeast cell. 

19. The nucleic acid construct according to Claim 17, wherein said nucleotide 
sequence is derived from a fungus. 

15 20. The nucleic acid construct according to Claim 19, wherein said fungus is of 

the genus Mortierella. 

21. The nucleic acid construct according to Claim 20, wherein said fungus is of 
the species Mortierella alpina. 

20 

22. A nucleic acid construct comprising: 

a fungal nucleotide sequence which encodes a polypeptide comprising an 
amino acid sequence which corresponds to or is complementary to an amino acid 
sequence depicted in SEQ ID NO: 2 or SEQ ID NO: 4, wherein said nucleic acid is 
25 operably associated with an expression control sequence functional in a microbial 

cell, wherein said nucleotide sequence encodes a functionally active polypeptide 
which desaturates a fatty acid molecule at carbon 6 or 12 from the carboxyl end of a 
fatty acid molecule. 
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23. A nucleic acid construct comprising: 

a nucleotide sequence having an A/T content of less than about 60% which 
encodes a functionally active A6-desaturase having an amino acid sequence which 
5 corresponds to or is complementary to all of or a portion of an amino acid sequence 

depicted in a SEQ ID NO: 2, wherein said nucleotide sequence is operably 
associated with a transcription control sequence functional in a yeast cell. 

24. A nucleic acid construct comprising: 

10 a fungal nucleotide sequence which encodes a functionally active A 12- 

desaturase having an amino acid sequence which corresponds to or is 
complementary to all of or a portion of an amino acid sequence depicted in a SEQ 
ID NO: 4, wherein said nucleotide sequence is operably associated with a 
transcription control sequence functional in a yeast cell. 

15 

25. A recombinant yeast cell comprising: 

a nucleic acid construct according to Claim 23 or Claim 24. 

26. The recombinant yeast cell according to Claim 25, wherein said yeast cell is 
20 a Sacchoromyces cell. 

27. A recombinant yeast cell comprising: 

at least one copy of a vector comprising a fungal nucleotide sequence which 
encodes a polypeptide which converts 18:2 fatty acids to 18:3 fatty acids or 18:3 
25 fatty acids to 1 8:4 fatty acids, wherein said yeast cell or an ancestor of said yeast cell 

was transformed with said vector to produce said recombinant yeast cell, and 
wherein said nucleotide sequence is operably associated with an expression control 
sequence functional in said recombinant yeast cell. 
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28. The recombinant yeast cell according to claim 27, wherein said 
fungal nucleotide sequence is a Mortierella nucleotide sequence. 

5 29. The recombinant yeast cell according to Claim 28, wherein said 

recombinant yeast cell is a Saccharomyces cell. 

30. The microbial cell according to Claim 27, wherein said expression 
control sequence is provided in said expression vector. 

10 

31. A method for production of GLA in a yeast culture, said method 
comprising: 

growing a yeast culture having a plurality of recombinant yeast cells, 
wherein said yeast cells or an ancestor of said yeast cells were transformed with a 
15 vector comprising fungal DNA encoding a polypeptide which converts LA to GLA, 

wherein said DNA is operably associated with an expression control sequence 
functional in said yeast cells, under conditions whereby said DNA is expressed, 
whereby GLA is produced from LA in said yeast culture. 

20 32. The method according to Claim 3 1 , wherein said fungal DNA is 

Mortierella DNA and said polypeptide is a A6 desaturase. 

33. The method according to Claim 32, wherein Mortierella is of the 
species Mortierella alpina. 

25 

34. The method according to Claim 3 1 , wherein said LA is exogenously 
supplied. 
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35. The method according to Claim 3 1 , wherein said conditions are 
inducible. 

36. A method for production of stearidonic acid in a yeast culture, said 
5 method comprising: 

growing a yeast culture having a plurality of recombinant yeast cells, 
wherein said yeast cells or an ancestor of said yeast cells were transformed with a 
vector comprising fungal DNA encoding a polypeptide which converts a-linolenic 
acid to stearidonic acid, wherein said DNA is operably associated with an expression 
1 0 control sequence functional in said yeast cells, under conditions whereby said DNA 

is expressed, whereby stearidonic acid is produced from a-linolenic acid in said 
yeast culture. 



37. The method according to Claim 36, wherein said fungal DNA is 
1 5 Mortierella DNA and said polypeptide is a A6 desaturase. 



38. The method according to Claim 37, wherein Mortierella is of the 
species Mortierella alpina. 



20 39. The method according to Claim 36, wherein said a-linolenic acid is 

exogenously supplied. 



40. The method according to Claim 36, wherein said conditions are 
inducible. 



25 



41 . A method for production of linoleic acid in a yeast culture, said 
method comprising: 
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growing a yeast culture having a plurality of recombinant yeast cells, 
wherein said yeast cells or an ancestor of said yeast cells were transformed with a 
vector comprising fungal DNA encoding a polypeptide which converts oleic acid to 
linoleic acid, wherein said DNA is operably associated with an expression control 
5 sequence functional in said yeast cells, under conditions whereby said DNA is 

expressed, whereby linoleic acid is produced from oleic acid in said yeast culture. 

42. The method according to Claim 41 , wherein said fungal DNA is 
Mortierella DNA and said polypeptide is a A12 desaturase. 

10 

43. The method according to Claim 42, wherein Mortierella is of the 
species Mortierella alpina. 

44. The method according to Claim 41, wherein said conditions are 
15 inducible. 

45. An isolated or purified polypeptide which desaturates a fatty acid 
molecule at carbon 12 from the carboxyl end of said polypeptide, wherein said 
polypeptide is a fungal polypeptide or is derived from a fungal polypeptide. 

20 

46. The isolated or purified polypeptide according to Claim 46, wherein 
said polypeptide is a Mortierrella alpina A12 desaturase. 

47. An isolated or purified polypeptide which desaturates a fatty acid 
25 molecule at carbon 6 from the carboxyl end of said polypeptide, wherein said 

polypeptide is a fiingal polypeptide or is derived from a fungal polypeptide. 
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48. The isolated or purified polypeptide according to Claim 48, wherein 
said polypeptide is a A6 desaturase. 



49. An isolated nucleic acid encoding a polypeptide according to Claim 
5 47 or Claim 49. 



50. The nucleic acid construct according to Claim 23, wherein said 
portion of an amino acid sequence depicted in SEQ.ID. NO: 2 comprises amino 
acids 1 through 457. 



10 



51. A host cell comprising: 

a nucleic acid construct according to any one of Claims 22 to 24. _ 

52. A host cell comprising: 

15 a vector which includes a nucleic acid which encodes a fatty acid desaturase 

derived from Mortierella alpina, wherein said desaturase has an amino acid 
sequence represented by SEQ ID NO:2, and wherein said nucleotide sequence is 
operably linked to a promoter. 

20 53. The host cell according to Claim 52, wherein said host cell is a 

eukaryotic cell. 

54. The host cell according to Claim 53, wherein said eukaryotic cell is 
selected from the group consisting of a mammalian cell, a plant cell, an insect cell, a 
25 fungal cell, an avian cell and an algal cell. 



55. The host cell according to Claim 54, wherein said host cell is a fungal 

cell. 
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56. The host cell of Claim 21, wherein said promoter is exogenously 
supplied to said host cell. 



5 57. A method for production of stearidonic acid in a eukaryotic cell 

culture, said method comprising: 

\ 

growing a eukaryotic cell culture having a plurality of recombinant 
eukaryotic cells, wherein said recombinant eukaryotic cells or ancestors of said 
recombinant eukaryotic cells were transformed with a vector comprising fungal 
10 DNA encoding a polypeptide which converts a-linolenic acid to stearidonic acid, 

wherein said DNA is operably associated with an expression control sequence 
functional in said recombinant eukaryotic cells, under conditions whereby said DNA 
is expressed, whereby stearidonic acid is produced from a-linolenic acid in said 
eukaryotic cell culture. 

15 

58. A method for production of linoleic acid in a eukaryotic cell 
culture, said method comprising: 

growing a eukaryotic cell culture having a plurality of recombinant 
eukaruyotic cells, wherein said recombinant eukaryotic cells or ancestors of said 
20 recombinant eukaryotic cells were transformed with a vector comprising fungal 

DNA encoding a polypeptide which converts oleic acid to linoleic acid, wherein said 
DNA is operably associated with an expression control sequence functional in said 
recombinant eukaryotic cells, under conditions whereby said DNA is expressed, 
whereby linoleic acid is produced from oleic acid in said eukaryotic cell culture. 



25 



59. The method according to Claim 57 or Claim 58, wherein said 
eukaryotic cells are selected from the group consisting of mammalian cells, plant 
cells, insect cells, fungal cells, avian cells and algal cells. 
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60. The method according to Claim 59, wherein said fungal cells are yeast 
cells of the genus Saccharomyces. 



61. A recombinant yeast cell comprising: 

5 (1) at least one nucleic acid construct according to Claim 23 or 24; or 

(2) at least one nucleic acid construct according to Claim 23 and at 
least one nucleic acid construct according to Claim 24. 

62. A recombinant yeast cell comprising: 

10 at least one nucleic acid construct comprising a nucleotide sequence which 

encodes a functionally active A6 desaturase having an amino acid sequence which 
corresponds to or is complementary to all or a portion of an amino acid sequence 
depicted in SEQ ID NO: 2, and at least one nucleic acid construct comprising a 
nucleotide sequence which encodes a functionally active A12 desaturase having an 

15 amino acid sequence which corresponds to or is complementary to all or a portion of 

an amino acid sequence depicted in SEQ ID NO: 4, wherein said nucleic acid 
constructs are operably associated with transcription control sequences functional in 
a yeast cell. 

20 63. A method of making GLA, said method comprising: 

growing a recombinant yeast cell according to Claim 62 under conditions 
whereby said nucleotide sequences are expressed , whereby GLA is produced in said 
yeast cell. 

25 64. A method of making GLA, said method comprising: 

growing a recombinant yeast cell according to Claim 61 under conditions 
whereby the nucleotide sequences in said nucleic acid constructs are expressed , 
whereby GLA is produced in said yeast cell. 
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65. A method for obtaining altered long chain polyunsaturated fatty acid 
biosynthesis comprising the steps of: 

growing a plant having cells which contain one or more transgenes, derived 
from a fungus or algae, which encodes a transgene expression product which 
5 desaturates a fatty acid molecule at a carbon selected from the group consisting of 

carbon 6 and carbon 12 from the carboxyl end of said fatty acid molecule, wherein 
said one or more trangenes is operably associated with an expression control 
sequence, under conditions whereby said one or more transgenes is expressed, 
whereby long chain polyunsaturated fatty acid biosynthesis in said cells is altered. 

10 

66. The method according to claim 65, wherein said long chain 
polyunsaturated fatty acid is selected from the group consisting of 18:1g>9, LA, 
GLA, SDA and ALA. 

15 67. A microbial oil or fraction thereof produced according to the method 

of claim 65. 

68. A method of treating or preventing malnutrition comprising 
administering said microbial oil of claim 67 to a patient in need of said treatment or 

20 prevention in an amount sufficient to effect said treatment or prevention. 

69. A pharmaceutical composition comprising said microbial oil or 
fraction of claim 67 and a pharmaceutical ly acceptable carrier. 

25 70. The pharmaceutical composition of claim 69, wherein said 

pharmaceutical composition is in the form of a solid or a liquid. 

71 . The pharmaceutical composition of claim 70, wherein said 
pharmaceutical composition is in a capsule or tablet form. 
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72. The pharmaceutical composition of claim 69 further comprising at 
least one nutrient selected from the group consisting of a vitamin, a mineral, a 
carbohydrate, a sugar, an amino acid, a free fatty acid, a phospholipid, an 
antioxidant, and a phenolic compound. 



73. A nutritional formula comprising said microbial oil or fraction 
thereof of claim 67. 



1 0 74. The nutritional formula of claim 73, wherein said nutritional formula 

is selected from the group consisting of an infant formula, a dietary supplement, and 
a dietary substitute. 



75. The nutritional formula of claim 74, wherein said infant formula, 
1 5 dietary supplement or dietary supplement is in the form of a liquid or a solid. 



76. An infant formula comprising said microbial oil or fraction thereof of 
claim 67. 



20 77. The infant formula of claim 76 further comprising at least one 

macronutrient selected from the group consisting of coconut oil, soy oil, canola oil, 
mono- and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein hydrolysates. 



25 78. The infant formula of claim 77 further comprising at least one 

vitamin selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, magnesium, 
zinc, manganese, sodium, potassium, phosphorus, copper, chloride, iodine, 
selenium, and iron. 
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79. A dietary supplement comprising said microbial oil or fraction 
thereof of claim 67. 



5 80. The dietary supplement of claim 79 further comprising at least one 

macronutrient selected from the group consisting of coconut oil, soy oil, canola oil, 
mono- and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein hydrolysates. 

10 81. The dietary supplement of claim 80 further comprising at least one 

vitamin selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, magnesium, 
zinc, manganese, sodium, potassium, phosphorus, copper, chloride, iodine, 
selenium, and iron. 

15 

82. The dietary supplement of claim 79 or claim 81, wherein said dietary 
supplement is administered to a human or an animal. 



83. A dietary substitute comprising said microbial oil or fraction thereof 
20 of claim 67. 

84. The dietary substitute of claim 83 further comprising at least one 
macronutrient selected from the group consisting of coconut oil, soy oil, canola oil, 
mono- and diglycerides, glucose, edible lactose, electrodialysed whey, 

25 electrodialysed skim milk, milk whey, soy protein, and other protein hydrolysates. 

85. The dietary substitute of claim 84 further comprising at least one 
vitamin selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, magnesium, 
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zinc, manganese, sodium, potassium, phosphorus, copper, chloride, iodine, 
selenium, and iron. 



86. The dietary substitute of claim 83 or claim 85, wherein said dietary 
5 substitute is administered to a human or animal. 



87. A method of treating a patient having a condition caused by 
insuffient intake or production of polyunsaturated fatty acids comprising 
administering to said patient said dietary substitute of claim 83 or said dietary 
10 supplement of claim 79 in an amount sufficient to effect said treatment. 



88. The method of claim 87, wherein said dietary substitute or said 
dietary supplement is administered enterally or parenterally. 



15 89. A cosmetic comprising said microbial oil or fraction thereof of claim 

67. 



90. The cosmetic of claim 88, wherein said cosmetic is applied topically. 

20 91 . The pharmaceutical composition of claim 69, wherein said 

pharmaceutical composition is administered to a human or an animal. 



92. An animal feed comprising said microbial oil or fraction thereof of 
claim 67. 



25 



93. The method of claim 20 wherein said fungus is Mortierella species. 
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94. The method of claim 93 wherein said fungus is Mortierella alpina. 

95. An isolated peptide sequence selected from the group consisting of 
SEQ ID NO:34 - SEQ ID NO:40. 

5 

! 

96. An isolated peptide sequence selected from the group consisting of 
\ SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:25 and SEQ ID NO:26. 

97. A method for production of gamma-linolenic acid in a eukaryotic cell 
1 0 culture, said method comprising: 

growing a eukaryotic cell culture having a plurality of recombinant 
eukaryotic cells, wherein said recombinant eukaryotic cells or ancestors of said 
recombinant eukaryotic cells were transformed with a vector comprising fungal 
DNA encoding a polypeptide which converts linoleic acid to gamma-linolenic acid, 
15 wherein said DNA is operably associated with an expression control sequence 

functional in said recombinant eukaryotic cells, under conditions whereby said DNA 
is expressed, whereby gamma-linolenic acid is produced from linoleic acid in said 
eukaryotic cell culture. 

20 98. The method according to Claim 97 wherein said eukaryotic cells are 

selected from the group consisting of mammalian cells, plant cells, insect cells, 
fungal cells, avian cells and algal cells. 
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FastA Match of ma29 and contig 253538a 

SCORES Initl: 117 Initn: 225 Opt: 256 

Smith-Waterman score: 408; 27.0% identity in 441 aa overlap 
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FastA Match of ma524 and contig 253533a 

SCORES Initl: 231 Initn: 499 Opt: 401 

Smith-Waterman score: 620; 27.3% identity in 455 aa overlap 
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